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Introduction 


An elliptic curve over Q is said to be modular if it has a finite covering by 
a modular curve of the form Xo(N). Any such elliptic curve has the property 
that its Hasse-Weil zeta function has an analytic continuation and satisfies a 
functional equation of the standard type. If an elliptic curve over Q with a 
given j-invariant is modular then it is easy to see that all elliptic curves with 
the same j-invariant are modular (in which case we say that the j-invariant 
is modular). A well-known conjecture which grew out of the work of Shimura 
and Taniyama in the 1950’s and 1960’s asserts that every elliptic curve over Q 
is modular. However, it only became widely known through its publication in a 
paper of Weil in 1967 [We] (as an exercise for the interested reader!), in which, | 
moreover, Weil gave conceptual evidence for the conjecture. Although it had 
been numerically verified in many cases, prior to the results described in this 
paper it had only been known that finitely many j-invariants were modular. 

In 1985 Frey made the remarkable observation that this conjecture should 
imply Fermat’s Last Theorem. The precise mechanism relating the two was 
formulated by Serre as the €-conjecture and this was then proved by Ribet in 
the summer of 1986. Ribet’s result only requires one to prove the conjecture 
for semistable elliptic curves in order to deduce Fermat’s Last Theorem. 


*The work on this paper was supported by an NSF grant. 
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Our approach to the study of elliptic curves is via their associated Galois 
representations. Suppose that pp is the representation of Gal(Q/Q) on the 
p-division points of an elliptic curve over Q, and suppose for the moment that 

p3 is irreducible. The choice of 3 is critical because a crucial theorem of Lang- 

lands and Tunnell shows that if p3 is irreducible then it is also modular. We 
then proceed by showing that under the hypothesis that p3 is semistable at 3, 
together with some milder restrictions on the ramification of p3 at the other 
primes, every suitable lifting of p3 is modular. To do this we link the problem, 
via some novel arguments from commutative algebra, to a class number prob- 
lem of a well-known type. This we then solve with the help of the paper [TW]. 
This suffices to prove the modularity of E as it is known that E is modular if 
and only if the associated 3-adic representation is modular. 

The key development in the proof is a new and surprising link between two 
strong but distinct traditions in number theory, the relationship between Galois 
representations and modular forms on the one hand and the interpretation of 
special values of L-functions on the other. The former tradition is of course 
more recent. Following the original results of Eichler and Shimura in the 
1950’s and 1960’s the other main theorems were proved by Deligne, Serre and 
Langlands in the period up to 1980. This included the construction of Galois 
representations associated to modular forms, the refinements of Langlands and 
Deligne (later completed by Carayol), and the crucial application by Langlands 
of base change methods to give converse results in weight one. However with 
the exception of the rather special weight one case, including the extension by 
Tunnell of Langlands’ original theorem, there was no progress in the direction 
of associating modular forms to Galois representations. From the mid 1980’s 
the main impetus to the field was given by the conjectures of Serre which 
elaborated on the €-conjecture alluded to before. Besides the work of Ribet and 
others on this problem we draw on some of the more specialized a a 
of the 1980’s, notably those of Hida and Mazur. 

The second tradition goes back to the famous analytic dass number for- 
mula of Dirichlet, but owes its modern revival to the conjecture of Birch and 
Swinnerton-Dyer. In practice however, it is the ideas of Iwasawa in this field on 
which we attempt to draw, and which to a large extent we have to replace. The 
principles of Galois cohomology, and in particular the fundamental theorems 
of Poitou and Tate, also play an important role here. 

The restriction that p3 be irreducible at 3 is bypassed by means of an 
intriguing argument with families of elliptic curves which share a common 
ps. Using this, we complete the proof that all semistable elliptic curves are 
modular. In particular, this finally yields a proof of Fermat’s Last Theorem. In 
addition, this method seems well suited to establishing that all elliptic curves 
over Q are modular and to generalization to other totally real number fields. 

Now we present our methods and results in more detail. 
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Let f be an eigenform associated to the congruence subgroup T(N) of 
SL2(Z) of weight k > 2 and character x. Thus if T, is the Hecke operator 
associated to an integer n there is an algebraic integer c(n, f) such that T,f = 
c(n, f)f for each n. We let Ky be the number field generated over Q by the 
{c(n, f)} together with the values of x and let Op be its ring of integers. 
For any prime A of Oy let Of be the completion of Op at A. The following 
theorem is due to Eichler and Shimura (for k = 2) and Deligne (for k > 2). 
The analogous result when k = 1 is a celebrated theorem of Serre and Deligne 
but is more naturally stated in terms of complex representations. The image 
in that case is finite and a converse is known in many cases. 


THEOREM 0.1. For each prime p € Z and each prime | p of Of there 
is a continuous representation 


Pf, Gal(Q/Q) — GLo(O;,y) 


which is unramified outside the primes dividing Np and such that for all primes 
q t Np, 


trace pf (Frobq) =c(q,f), det pz,,(Frob q) = x(q)q*™". 


We will be concerned with trying to prove results in the opposite direction, 
that is to say, with establishing criteria under which a A-adic representation 
arises in this way from a modular form. We have not found any advantage 
in assuming that the representation is part of a compatible system of \-adic 
representations except that the proof may be easier for some 4 than for others. 

Assume 


po : Gal(Q/Q) — GL2(Fp) 


is a continuous representation with values in the algebraic closure of a finite 
field of characteristic p and that det po is odd. We say that po is modular 
if po and ps, mod A are isomorphic over F, for some f and A and some 
embedding of Of/A in F,. Serre has conjectured that every irreducible po of . 
odd determinant is modular. Very little is known about this conjecture except 
when the image of pp in PGLo (Fp) is dihedral, A, or S4. In the dihedral case 
it is true and due (essentially) to Hecke, and in the A, and S4 cases it is again 
true and due primarily to Langlands, with one important case due to Tunnell 
(see Theorem 5.1 for a statement). More precisely these theorems actually 
associate a form of weight one to the corresponding complex representation 
but the versions we need are straightforward deductions from the complex 
case. Even in the reducible case not much is known about the problem in 
the form we have described it, and in that case it should be observed that 
one must also choose the lattice carefully as only the semisimplification of 
PFa = pf, mod À is independent of the choice of lattice in K? ` 
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If O is the ring of integers of a local field (containing Qp) we will say that 
p : Gal(Q/Q) — GL2(0) is a lifting of po if, for a i embedding of the 
residue field of O in F,, p and po are isomorphic over F,. Our point of view 
will be to assume that po is modular and then to ean to give conditions 
under which a representation p lifting pọ comes from a modular form in the 
sense that p ~ pf, over Ky, for some f, A. We will restrict our attention to 
two cases: 


(I) po is ordinary (at p) by which we mean that there is a one-dimensional 
subspace of Fo stable under a decomposition group at p and such that 
the action on the quotient space is unramified and distinct from the 
action on the subspace. 


(II) po is flat (at p), meaning that as a representation of a decomposition 
group at p, po is equivalent to one that arises from a finite flat group 
scheme over Zp, and det po restricted to an inertia group at p is the 
cyclotomic character. 


We say Smily that p is ordinary (at p) if, viewed as a representation to Q2, 
there is a one-dimensional subspace of Q? stable under a decomposition group 
at p and such that the action on the quotient space is unramified. 

Let € : Gal(Q/Q) — Z denote the cyclotomic character. Conjectural 
converses to Theorem 0.1 have been part of the folklore for many years but 
have hitherto lacked any evidence. The critical idea that one might dispense 
with compatible systems was already observed by Drinfeld in the function field 
_ case [Dr]. The idea that one only needs to make a geometric condition on the 
restriction to the decomposition group at p was first suggested by Fontaine and 
Mazur. The following version is a natural extension of Serre’s conjecture which 
is convenient for stating our results and is, in a slightly modified form, the one 
proposed by Fontaine and Mazur. (In the form stated this incorporates Serre’s 
conjecture. We could instead have made the hypothesis that po is modular.) 


CONJECTURE. Suppose that p:Gal(Q/Q) — GL2(0) is an irreducible 
lifting of po and that p is unramified outside of a finite set of primes. There 
are two cases: 


(i) Assume that po is ordinary. Then if p is ordinary and det p =e*—1y for 
some integer k > 2 and some x of finite order, p comes from a modular 
form. 


(ii) Assume that po is flat and that p is odd. Then if p restricted to a de- 
composition group at p is equivalent to a representation on a p-divisible 
group, again p comes from a modular form. 
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In case (ii) it is not hard to see that if the form exists it has to be of 
weight 2; in (i) of course it would have weight k. One can of course enlarge 
this conjecture in several ways, by weakening the conditions in (i) and (ii), by 
considering other number fields in place of Q and by considering groups other 
than GL». | | | 

We prove two results concerning this conjecture. The first includes the 
hypothesis that pp is modular. Here and for the rest of the paper we will 
assume that p is an odd prime. | 


THEOREM 0.2. Suppose that po is irreducible and satisfies either (I) or 
(II) above. Suppose also that po is modular and that 


(i) po ts absolutely irreducible when restricted to Q ( (1) F p). 


(ii) If q = —l1modp is ramified in po then either po|p, is reducible over 
the algebraic closure where Dg is a decomposition group at q or poļi, is 
absolutely irreducible where Iq is an inertia group at q. 


Then any representation p as in the conjecture does indeed come from a mod- 
ular form. 


= The only condition which really seems essential to our method is the re- 
quirement that pọ be modular. 

The most interesting case at the moment is when p = 3 and pọ can be de- 
fined over F3. Then since PGL2(F3) ~ S4 every such representation is modular 
by the theorem of Langlands and Tunnell mentioned above. In particular, ev- 
ery representation into GL2(Z3) whose reduction satisfies the given conditions 
is modular. We deduce: 


THEOREM 0.3. Suppose that E is an elliptic curve defined over Q and 
that po is the Galois action on the 3-division points. Suppose that E has the 
following properties: 


(i) E has good or multiplicative reduction at 3. 
(ii) po ts absolutely irreducible when restricted to Q(V-3). 


(ih) For any g = —Lmad3 either polo, is reducible over the algebraic closure 
or po|r, is absolutely irreducible. 


Then E is modular. — 


We should point out that while the properties of the zeta function follow 
directly from Theorem 0.2 the stronger version that E is covered by Xo(N) 
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requires also the isogeny theorem proved by Faltings (and earlier by Serre when 
E has nonintegral j-invariant, a case which includes the semistable curves). 
We note that if E is modular then so is any twist of E, so we could relax 
condition (i) somewhat. | 

The important class of semistable curves, i.e., those with square-free con- 
ductor, satisfies (i) and (iii) but not necessarily (ii). If (ii) fails then in fact po 
is reducible. Rather surprisingly, Theorem 0.2 can often be applied in this case 
also by showing that the representation on the 5-division points also occurs for 
another elliptic curve which Theorem 0.3 has already proved modular. Thus 
Theorem 0.2 is applied this time with p = 5. This argument, which is explained 
in Chapter 5, is the only part of the paper which really uses deformations of 
the elliptic curve rather than deformations of the Galois representation. The 
argument works more generally than in the semistable case but in this setting 
we obtain the following theorem: 


THEOREM 0.4. Suppose that E is a semistable elliptic curve defined over 
Q. Then E is modular. 


More general families of elliptic curves which are modular are given in Chap- 
ter 5. 

In 1986, stimulated by an ingenious idea of Frey [Fr], Serre conjectured 
and Ribet proved (in [Ril]) a property of the Galois representations associated 
to modular forms which enabled Ribet to show that Theorem 0.4 implies ‘Fer- 
mat’s Last Theorem’. Frey’s suggestion, in the notation of the following theo- 
rem, was to show that the (hypothetical) elliptic curve y? = x(x + uP) (x — vP) 
could not be modular. Such elliptic curves had already been studied in [He] 
‘but without the connection with modular forms. Serre made precise the idea 
of Frey by proposing a conjecture on modular forms which meant that the rep- 
resentation on the p-division points of this particular elliptic curve, if modular, 
would be associated to a form of conductor 2. This, by a simple inspection, 
could not exist. Serre’s conjecture was then proved by Ribet in the summer 
of 1986. However, one still needed to know that the curve in question would 
have to be modular, and this is accomplished by Theorem 0.4. We have then 
(finally!): | 


THEOREM 0.5. Suppose that u?+v?+w? = 0 with u,v,w E€ Q and p > 3, 
then uvw = 0. 


a 


The second result we prove about the conjecture does not require the 
assumption that pọ be modular (since it is already known in this case). | 
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- THEOREM 0.6. Suppose that po ts irreducible and satisfies the hypotheses 
of the conjecture, including (I) above. Suppose further that 


(i) po = Ind? ko for a character ko of an imaginary quadratic extension L 
of Q which is unramified at p. 


(ii) det po|7, = w. 


Then a representation p as in the conjecture does indeed come from a modular 
form. 


This theorem can also be used to prove that certain families of elliptic 
curves are modular. In this summary we have only described the principal 
theorems associated to Galois representations and elliptic curves. Our results 
concerning generalized class groups are described in Theorem 3.3. 

The following is an account of the origins of this work and of the more 
specialized developments of the 1980’s that affected it. I began working on 
these problems in the late summer of 1986 immediately on learning of Ribet’s 
result. For several years I had been working on the Iwasawa conjecture for 
totally real fields and some applications of it. In the process, I had been using 
and developing results on ¢-adic representations associated to Hilbert modular 
forms. It was therefore natural for me to consider the problem of modularity 
from the point of view of -adic representations. I began with the assumption 
that the reduction of a given ordinary ¢-adic representation was reducible and 
tried to prove under this hypothesis that the representation itself would have 
to be modular. I hoped rather naively that in this situation I could apply the 
techniques of Iwasawa theory. Even more optimistically I hoped that the case 
l = 2 would be tractable as this would suffice for the study of the curves used 
by Frey. From now on and in the main text, we write p for £ because of the 
connections with Iwasawa theory. 

After several months studying the 2-adic representation, I made the first 
real breakthrough in realizing that I could use the 3-adic representation instead: 
the Langlands-Tunnell theorem meant that p3, the mod 3 representation of any 


given elliptic curve over Q, would necessarily be modular. This enabled me _ 


to try inductively to prove that the GL2(Z/3”" Z) representation would be 
modular for each n. At this time I considered only the ordinary case. This led 
quickly to the study of H*(Gal(F../Q), Wp) for i = 1 and 2, where F is the 
splitting field of the m-adic torsion on the Jacobian of a suitable modular curve, 
m being the maximal ideal of a Hecke ring associated to p3 and Wy the module 
associated to a modular form f described in Chapter 1. More specifically, I 
needed to compare this cohomology with the cohomology of Gal(Qs,/Q) acting 
on the same module. | | 

I tried to apply some ideas from Iwasawa theory to this problem. In my 
solution to the Iwasawa conjecture for totally real fields [Wi4], I had introduced 
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a new technique in order to deal with the trivial zeroes. It involved replacing 
the standard Iwasawa theory method of considering the fields in the cyclotomic 
Z,-extension by a similar analysis based on a choice of infinitely many distinct 
primes q; = 1modp™ with n; > co as i > oo. Some aspects of this method 
suggested that an alternative to the standard technique of Iwasawa theory, 
which seemed problematic in the study of W;, might be to make a comparison 
between the cohomology groups as © varies but with the field Q fixed. The 
new principle said roughly that the unramified cohomology classes are trapped 
by the tamely ramified ones. After reading the paper [Gre1], I realized that the 
duality theorems in Galois cohomology of Poitou and Tate would be useful for 
this. The crucial extract from this latter theory is in Section 2 of Chapter 1. 

In order to put these ideas into practice I developed in a naive form the 
techniques of the first two sections of Chapter 2. This drew in particular on 
a detailed study of all the congruences between f and other modular forms 
of differing levels, a theory that had been initiated by Hida and Ribet. The 
outcome was that I could estimate the first cohomology group well under two 
assumptions, first that a certain subgroup of the second cohomology group 
vanished and second that the form f was chosen at the minimal level for m. 
These assumptions were much too restrictive to be really effective but at least 
they pointed in the right direction. Some of these arguments are to be found 
in the second section of Chapter 1 and some form the first weak approximation 
to the argument in Chapter 3. At that time, however, I used auxiliary primes 
q = —l mod p when varying ¥ as the geometric techniques I worked with did 
not apply in general for primes q = 1modp. (This was for much the same 
reason that the reduction of level argument in [Ril] is much more difficult 
when q = 1modp.) In all this work I used the more general assumption that — 
Pp was modular rather than the assumption that p = 3. 

In the late 1980’s, I translated these ideas into ring-theoretic language. A 
few years previously Hida had constructed some explicit one-parameter fam- 
ilies of Galois representations. In an attempt to understand this, Mazur had 
been developing the language of deformations of Galois representations. More- 
over, Mazur realized that the universal deformation rings he found should be 
given by Hecke rings, at least in certain special cases. This critical conjecture 
refined the expectation that all ordinary liftings of modular representations 
should be modular. In making the translation to this ring-theoretic language 
I realized that the vanishing assumption on the subgroup of H? which I had 
needed should be replaced by the stronger condition that the Hecke rings were 
complete intersections. This fitted well with their being deformation rings 
where one could estimate the number of generators and relations and so made 
the original assumption more plausible. 7 

To be of use, the deformation theory required some development. Apart 
from some special examples examined by Boston and Mazur there had been 
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little work on it. I checked that one could make the appropriate adjustments to 
the theory in order to describe deformation theories at the minimal level. In the 
fall of 1989, I set Ramakrishna, then a student of mine at Princeton, the task 
of proving the existence of a deformation theory associated to representations 
arising from finite flat group schemes over Zp. This was needed in order to | 
remove the restriction to the ordinary case. These developments are described 
in the first section of Chapter 1 although the work of Ramakrishna was not 
completed until the fall of 1991. For a long time the ring-theoretic version 
of the problem, although more natural, did not look any simpler. The usual 
methods of Iwasawa theory when translated into the ring-theoretic language 
seemed to require unknown principles of base change. One needed to know the 
exact relations between the Hecke rings for different fields in the cyclotomic 
Z,-extension of Q, and not just the relations up to torsion. 

The turning point in this and indeed in the whole proof came in the 
spring of 1991. In searching for a clue from commutative algebra I had been 
particularly struck some years earlier by a paper of Kunz [Ku2]. I had already 
needed to verify that the Hecke rings were Gorenstein in order to compute the 
= congruences developed in Chapter 2. This property had first been proved by 
Mazur in the case of prime level and his argument had already been extended 
by other authors as the need arose. Kunz’s paper suggested the use of an 
invariant (the 7-invariant of the appendix) which I saw could be used to test 
for isomorphisms between Gorenstein rings. A different invariant (the p/p?- 
invariant of the appendix) I had already observed could be used to test for 
isomorphisms between complete intersections. It was only on reading Section 6 
of [Ti2] that I learned that it followed from Tate’s account of Grothendieck 
duality theory for complete intersections that these two invariants were equal 
for such rings. Not long afterwards I realized that, unlikely though it seemed at 
first, the equality of these invariants was actually a criterion for a Gorenstein 
ring to be a complete intersection. These arguments are given in the appendix. 

The impact of this result on the main problem was enormous. Firstly, the 
relationship between the Hecke rings and the deformation rings could be tested 
just using these two invariants. In particular I could provide the inductive ar- 
gument of Section 3 of Chapter 2 to show that if all liftings with restricted 
ramification are modular then all liftings are modular. This I had been trying 
to do for a long time but without success until the breakthrough in commuta- 
tive algebra. Secondly, by means of a calculation of Hida summarized in [Hi2] 
the main problem could be transformed into a problem about class numbers 
of a type well-known in Iwasawa theory. In particular, I could check this in 
the ordinary CM case using the recent theorems of Rubin and Kolyvagin. This 
is the content of Chapter 4. Thirdly, it meant that for the first time it could 
be verified that infinitely many j-invariants were modular. Finally, it meant 
that I could focus on the minimal level where the estimates given by my earlier 
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Galois cohomology calculations looked more promising. Here I was also using 
the work of Ribet and others on Serre’s conjecture (the same work of Ribet 
that had linked Fermat’s Last Theorem to modular forms in the first place) to 
know that there was a minimal level. | 

The class number problem was of a type well-known in Iwasawa theory 
and in the ordinary case had already been conjectured by Coates and Schmidt. 
However, the traditional methods of Iwasawa theory did not seem quite suf- 
ficient in this case and, as explained earlier, when translated into the ring- 
theoretic language seemed to require unknown principles of base change. So 
instead I developed further the idea of using auxiliary primes to replace the. 
change of field that is used in Iwasawa theory. The Galois cohomology esti- 
mates described in Chapter 3 were now much stronger, although at that time 
I was still using primes q = —1modp for the argument. The main difficulty 
was that although I knew how the 7-invariant changed as one passed to an 
auxiliary level from the results of Chapter 2, I did not know how to estimate 
the change in the p/p?-invariant precisely. However, the method did give the 
right bound for the generalised class group, or Selmer group as it is often called 
in this context, under the additional assumption that the minimal Hecke ring 
was a complete intersection. 

I had earlier realized that ideally what I needed in this method of auxiliary 
primes was a replacement for the power series ring construction one obtains in 
the more natural approach based on Iwasawa theory. In this more usual setting, 
the projective limit of the Hecke rings for the varying fields in a cyclotomic 
tower would be expected to be a power series ring, at least if one assumed 
the vanishing of the p-invariant. However, in the setting with auxiliary primes 
where one would change the level but not the field, the natural limiting process 
did not appear to be helpful, with the exception of the closely related and very 
important construction of Hida [Hil]. This method of Hida often gave one step 
towards a power series ring in the ordinary case. There were also tenuous hints 
of a patching argument in Iwasawa theory ({Scho], [Wi4, §10]), but I searched 
without success for the key. 

Then, in August, 1991, I learned of a new construction of Flach [Fl] and 
quickly became convinced that an extension of his method was more plausi- 
ble. Flach’s approach seemed to be the first step towards the construction of 
an Euler system, an approach which would give the precise upper bound for 
the size of the Selmer group if it could be completed. By the fall of 1992, I 
believed I had achieved this and began then to consider the remaining case 
where the mod 3 representation was assumed reducible. For several months I 
tried.simply to repeat the methods using deformation rings and Hecke rings. 
Then unexpectedly in May 1993, on reading of a construction of twisted forms 
of modular curves in a paper of Mazur [Ma3], I made a crucial and surprising 
breakthrough: I found the argument using families of elliptic curves with a 
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common ps5 which is given in Chapter 5. Believing now that the proof was 
complete, I sketched the whole theory in three lectures in Cambridge, England 
on June 21-23. However, it became clear to me in the fall of 1993 that the con- 
struction of the Euler system used to extend Flach’s method was incomplete 
and possibly flawed. 

Chapter 3 follows the original approach I had taken to the problem of 
bounding the Selmer group but had abandoned on learning of Flach’s paper. 
Darmon encouraged me in February, 1994, to explain the reduction to the com- 
plete intersection property, as it gave a quick way to exhibit infinite families 
of modular j-invariants. In presenting it in a lecture at Princeton, I made, 
almost-unconsciously, a critical switch to the special primes used in Chapter 3 
as auxiliary primes. I had only observed the existence and importance of these 
primes in the fall of 1992 while trying to extend Flach’s work. Previously, I had 
only used primes q = —1 mod p as auxiliary primes. In hindsight this change 
was crucial because of a development due to de Shalit. As explained before, I 
had realized earlier that Hida’s theory often provided one step towards a power 
series ring at least in the ordinary case. At the Cambridge conference de Shalit 
had explained to me that for primes q = 1 mod p he had obtained a version of 
Hida’s results. But except for explaining the complete intersection argument 
in the lecture at Princeton, I still did not give any thought to my initial ap- 
proach, which I had put aside since the summer of 1991, since I continued to 
believe that the Euler system approach was the correct one. 

Meanwhile in January, 1994, R. Taylor had joined me in the attempt to 
repair the Euler system argument. Then in the spring of 1994, frustrated in 
the efforts to repair the Euler system argument, I began to work with Taylor 
on an attempt to devise a new argument using p = 2. The attempt to use p = 2 
reached an impasse at the end of August. As Taylor was still not convinced that 
the Euler system argument was irreparable, I decided in September to take one 
last look at my attempt to generalise Flach, if only to formulate more precisely _ 
the obstruction. In doing this I came suddenly to a marvelous revelation: I 
saw in a flash on September 19th, 1994, that de Shalit’s theory, if generalised, 
could be used together with duality to glue the Hecke rings at suitable auxiliary 
levels into a power series ring. I had unexpectedly found the missing key to my 
old abandoned approach. It was the old idea of picking q;’s with q; = 1 mod p™ 
and n; — oo as i — œo that I used to achieve the limiting process. The switch 
to the special primes of Chapter 3 had made all this possible. 

After I communicated the argument to Taylor, we spent the next few days 
making sure of the details. The full argument, together with the deduction of 
the complete intersection property, is given in [TW]. 

In conclusion the key breakthrough in the proof had been the realization 
in the spring of 1991 that the two invariants introduced in the appendix could 
be used to relate the deformation rings and the Hecke rings. In effect the n- 
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invariant could be used to count Galois representations. The last step after the 
June, 1993, announcement, though elusive, was but the conclusion of a long 
process whose purpose was to replace, in the ring-theoretic setting, the methods 
based on Iwasawa theory by methods based on the use of auxiliary primes. 

One improvement that I have not included but which might be used to 
simplify some of Chapter 2 is the observation of Lenstra that the criterion for 
Gorenstein rings to be complete intersections can be extended to more general 
rings which are finite and free as Zy-modules. Faltings has pointed out an 
improvement, also not included, which simplifies the argument in Chapter 3 
and [TW]. This is however explained in the appendix to [TW]. 

It is a pleasure to thank those who read carefully a first draft of some of this 
paper after the Cambridge conference and particularly N. Katz who patiently 
answered many questions in the course of my work on Euler systems, and 
together with Illusie read critically the Euler system argument. Their questions 
led to my discovery of the problem with it. Katz also listened critically to my 
first attempts to correct it in the fall of 1993. I am grateful also to Taylor for 
his assistance in analyzing in depth the Euler system argument. I am indebted 
to F. Diamond for his generous assistance in the preparation of the final version 
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Chapter 1 


This chapter is devoted to the study of certain Galois representations. 
In the first section we introduce and study Mazur’s deformation theory and 
discuss various refinements of it. These refinements will be needed later to 
make precise the correspondence between the universal deformation rings and 
the Hecke rings in Chapter 2. The main results needed are Proposition 1.2 
which is used to interpret various generalized cotangent spaces as Selmer groups 
and (1.7) which later will be used to study them. At the end of the section we 
relate these Selmer groups to ones used in the Bloch-Kato conjecture, but this 
connection is not needed for the proofs of our main results. 

In the second section we extract from the results of Poitou and Tate on 
Galois cohomology certain general relations between Selmer groups as » varies, 
as well as between Selmer groups and their duals. The most important obser- 
vation of the third section is Lemma 1.10(i) which guarantees the existence of 
the special primes used in Chapter 3 and [TW]. 


1. Deformations of Galois representations 


Let p be an odd prime. Let © be a finite set of primes including p and 
let Qs be the maximal extension of Q unramified outside this set and oo. 
Throughout we fix an embedding of Q, and so also of Qg, in C. We will also 
fix a choice of decomposition group Dy, for all primes g in Z. Suppose that k 
is a finite field of characteristic p and that 


(1.1) po: Gal(Qs/Q) > GL2(k) 


is an irreducible representation. In contrast to the introduction we will assume 
in the rest of the paper that pọ comes with its field of definition k. Suppose 
further that det pọ is odd. In particular this implies that the smallest field of 
definition for po is given by the field ko generated by the traces but we will not 
assume that k = ko. It also implies that po is absolutely irreducible. We con- 
sider the deformations [p] to GL2(A) of po in the sense of Mazur [Ma1]. Thus 
if W(k) is the ring of Witt vectors of k, A is to be a complete Noetherian local 
W (k)-algebra with residue field k and maximal ideal m, and a deformation [p] 
is just a strict equivalence class of homomorphisms p: Gal(Qy/Q) — GLo2(A) 
such that p mod m = po, two such homomorphisms being called strictly equiv- 
alent if one can be brought to the other by conjugation by an element of 
ker : GL2(A) — GlLe(k). We often simply write p instead of [p] for the 
equivalence class. 
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We will restrict our choice of po further by assuming that either: 


(i) po is ordinary; viz., the restriction of po to the decomposition group D, 
has (for a suitable choice of basis) the form 


| X1 * 
1.2 N 
( ) i Dp ( 0 x2 
where xı and x2 are homomorphisms from D, to k* with x2 unramified. 
Moreover we require that xı # x2. We do allow here that po|p, be 


semisimple. (If xı and x2 are both unramified and po|p, is semisimple 
then we fix our choices of xı and x2 once and for all.) 


(ii) po is flat at p but not ordinary (cf. [Sel] where the terminology finite is 
used); viz., o|p, is the representation associated to a finite flat group 
scheme over Z, but is not ordinary in the sense of (i). (In general when we 
refer to the flat case we will mean that pp is assumed not to be ordinary 
unless we specify otherwise.) We will assume also that det po|z, = w 
where J, is an inertia group at p and w is the Teichmüller character 


giving the action on pt? roots of unity. 


In case (ii) it follows from results of Raynaud that po|p, is absolutely 
irreducible and one can describe po|z, explicitly. For extending a Jordan-H6lder 
series for the representation space (as an J,-module) to one for finite flat group 
schemes (cf. [Ray1]) we observe first that the trivial character does not occur on 
a subquotient, as otherwise (using the classification of Oort-Tate or Raynaud) 
the group scheme would be ordinary. So we find by Raynaud’s results, that 
po\ Tp S k ~ yı ® y2 where yı and we are the two fundamental characters of 


degree 2 (cf. Corollary 3.4.4 of [Ray1]). Since %ı and %2 do not extend to 
characters of Gal(Q,/Q,), po|p, must be absolutely irreducible. | 

We will sometimes wish to make one of the following restrictions on the 
deformations we allow: 


(i) (a) Selmer deformations. In this case we assume that po is ordinary, with no- 
tation as above, and that the deformation has a representative 
p : Gal(Qs/Q) — GL2(A) with the property that (for a suitable choice 


of basis) 
— (X * 
Plos ~ ( 0 4 


with X2 unramified, X2 = x2 mod m, and det p| lL = ew™tyixz2 where 

€ is the cyclotomic character, e: Gal(Qz/Q) — Z% , giving the action 
on all p-power roots of unity, w is of order prime to p satisfying w = € 
mod p, and xı and x2 are the characters of (i) viewed as taking values in 
k* <> A*. 


MODULAR ELLIPTIC CURVES AND FERMAT’S LAST THEOREM 457 


(i) (b) Ordinary deformations. The same as in (i)(a) but with no condition on 
the determinant. 


(i) (c) Strict deformations. This is a variant on (i) (a) which we only use when 
polp, is not semisimple and not flat (i.e. not associated to a finite flat 
group scheme). We also assume that x1x> 1 — w in this case. Then a 
strict deformation is as in (i)(a) except that we assume in addition that 


(X1/X2)|D, = €. 


(ii 


ee” 


Flat (at p) deformations. We assume that each deformation p to GL2(A) 
has the property that for any quotient A/a of finite order p|p, mod a 
is the Galois representation associated to the Q,-points of a finite flat 
group scheme over Zp. 


In each of these four cases, as well as in the unrestricted case (in which we 
impose no local restriction at p) one can verify that Mazur’s use of Schlessinger’s 
criteria [Sch] proves the existence of a universal deformation 


p: Gal(Qs/Q) —> GL2( R). 


In the ordinary and unrestricted case this was proved by Mazur and in the 
flat case by Ramakrishna [Ram]. The other cases require minor modifications 
of Mazur’s argument. We denote the universal ring Ry in the unrestricted 
case and R$, Re¢, R", RE in the other four cases. We often omit the X if the 
context makes it clear. 

There are certain generalizations to all of the above which we will also 
need. The first is that instead of considering W (k) -algebras A we may consider 
O-algebras for O the ring of integers of any local field with residue field k. If. 
we need to record which © we are using we will write Ryo etc. It is easy to 
see that the natural local map of local O-algebras | 

Ryo > Ry & O 
W(k) 
is an isomorphism because for functorial reasons the map has a natural section 
which induces an isomorphism on Zariski tangent spaces at closed points, and 
one can then use Nakayama’s lemma. Note, however, that if we change the 
residue field via i: k— k’ then we have a new deformation problem associated _ 
to the representation pj = io po. There is again a natural map of W(k’)- 
algebras | 
/ / 
R) > B® Wik) 
which is an isomorphism on Zariski tangent spaces. One can check that this 
is again an isomorphism by considering the subring Rı of R(pp) defined as the 
subring of all elements whose reduction modulo the maximal ideal lies in k. 
Since R(p}) is a finite R;-module, R; is also a complete local Noetherian ring 
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with residue field k. The universal representation associated to pp is defined 

over R, and the universal property of R then defines a map R — Rı. So we 

obtain a section to the map R(ph) > R & W(k’) and the map is therefore 
W(k) 


an isomorphism. (I am grateful to Faltings for this observation.) We will also 
need to extend the consideration of O-algebras to the restricted cases. In each 
case we can require A to be an O-algebra and again it is easy to see that 
Rsa ~ Ra & O in each case. | 

z,O D Wik) 


The second generalization concerns primes q # p which are ramified in po. 
We distinguish three special cases (types (A) and (C) need not be disjoint): 


(A) polp, = (% =) for a suitable choice of basis, with x; and x2 unramified, 


X1 Xz = w and the fixed space of J, of dimension 1, 


(B) ee l = 6 % xg # 1, for a suitable choice of basis, 


(C) H1(Q,, Wx) = 0 where Wh, is as defined in (1.6). 


Then in each case we can define a suitable deformation theory by imposing 
additional restrictions on those we have already considered, namely: 


p2 


(A) plp, = (* Ž) for a suitable choice of basis of A? with yı and %2 un- 
| ramified and ~~ 1 = g; | 


(B) plz, = (5° °) for a suitable choice of basis (xq of order prime to p, so the 


same character as above); 
(C) det plz, = det polz,, i.e., of order prime to p. 


Thus if M is a set of primes in È distinct from p and each satisfying one of 
(A), (B) or (C) for po, we will impose the corresponding restriction at each 
prime in M. 

Thus to each set of data D = {-,&,O,M} where - is Se, str, ord, flat or 
unrestricted, we can associate a deformation theory to po provided 


(1.3) po: Gal(Qs/Q) > GLe(k) 


is itself of type D and O is the ring of integers of a totally ramified extension 
of W(k); po is ordinary if - is Se or ord, strict if - is strict and flat if - is fl 
(meaning flat); po is of type M, i.e., of type (A), (B) or (C) at each ramified 
prime q Æ p,q E M. We allow different types at different q’s. We will refer 
to these as the standard deformation theories and write Rp for the universal 
ring associated to D and pp for the universal deformation (or even p if D is 
clear from the context). 

We note here that if D = (ord, £, O, M) and D’ = (Se, £, O, M) then 
there is a simple relation between Rp and Rp. Indeed there is a natural map 
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Rp — Rp by the universal property of Rp, and its kernel is a principal ideal 
generated by T = e~t (y) det pp(7) — 1 where y € Gal(Qs/Q) is any element 
whose restriction to Gal(Q../Q) is a generator (where Qa is the Zp-extension | 
of Q) and whose restriction to Gal(Q(Cnp)/Q) is trivial for any N prime to p 
with Cy E Qs, Cn being a primitive N* root of 1: 


(1.4) Rp/T ~ Rp. 3 


It turns out that under the hypothesis that po is strict, i.e. that po|D, 
is not associated to a finite flat group scheme, the deformation problems in 
(i)(a) and (i)(c) are the same; i.e., every Selmer deformation is already a strict 
deformation. This was observed by Diamond. The argument is local, so the 
decomposition group Dp could be replaced by Gal(Q,/Q,). 


PROPOSITION 1.1 (Diamond). Suppose that 7:Dp) — GLe(A) is a con- 
tinuous representation where A is an Artinian local ring with residue field k, a 
finite field of characteristic p. Suppose 7 % lo ee) with xı and xz unramified © 
and xı # X2. Then the residual representation 7 is associated to a finite flat 
group scheme over Zp. 


Proof (taken from [Dia, Prop. 6.1]). We may replace 7 by t @ x3" and 
we let p = x;Xz_- Then m & ($° t) determines a cocycle t : Dp —> M(1) where 
M is a free A-module of rank one on which D, acts via y. Let u denote the 
cohomology class in H!(D,, M(1)) defined by t, and let uo denote its image 
-` in H'(Dy,Mo(1)) where Mo = M/mM. Let G = ker y and let F be the fixed 
field of G (so F is a finite unramified extension of Qp). Choose n so that p” A 
= 0. Since H?(G, upr) —> H*(G, ups) is injective for r < s, we see that the 
natural map of A[Dy / G]-modules H! (G, upr) Z, M > H'(G, M(1)) is an- 
isomorphism. By Kummer theory, we have H!(G, M(1)) S F*/(F*)?" Z, M 
as Dp-modules. Now consider the commutative diagram 


H! (G, M(1))2»—~((F*/(F*)"" 8z; M)2> ——+ MP» 


H! (G, Mo(1)) —— (F*/(F*)?) @p, Mo ——> Mo 


where the right-hand horizontal maps are induced by vp : F* — Z. If p £1, 
then M”? C mM, so that the element res uo of H'(G, Mo(1)) is in the image 
of (OF /(O%)?) 9p, Mo. But this means that 7 is “peu ramifié” in the sense of 
[Se] and therefore 7 comes from a finite flat group scheme. (See [E1, (8.2)].) 


Remark. Diamond also observes that essentially the same proof shows 
that if m : Gal(Q,/Q,) — GL2(A), where A is a complete local Noetherian 
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ring with residue field k, has the form 7|z, S (5 7) with 7 ramified then z is 
of type (A). 


Globally, Proposition 1.1 says that if po is strict and if D = (Se, £, O, M) 
and D’ = (str, X, '©, M) then the natural map Rp — Ry is an isomorphism. 

In each case the tangent space of Rp may be computed as in [Mal]. Let 
A be a uniformizer for O and let U, ~ k? be the representation space for po. 
(The motivation for the subscript À will become apparent later.) Let Vy be the 
representation space of Gal(Qs;/Q) on Ad pp = Hom, (Uy, Ux) ~ Mo(k). Then 
there is an isomorphism of k-vector spaces (cf. the proof of Prop. 1.2 below) 


(1.5) Hom,(mp/(m3, A), k) ~ H4(Qs/Q,V) 


where Hi(Qs/Q, Vy) is a subspace of H1(Qs/Q, Va) which we now describe 
and mp is the maximal ideal of Rp. It consists of the cohomology classes 
which satisfy certain local restrictions at p and at the primes in M. We call 
mp/(m4,, A) the reduced cotangent space of Rp. 

We begin with p. First we may write (since p # 2), as k[Gal(Qs/Q)]- 
modules, 


(1.6) V = Wy) ®k, where Wy 


{f € Hom;(Uy, Uy): trace f = 0} 
(Sym? & det~1)p 


[2 


and k is the one-dimensional subspace of scalar multiplications. Then if po 
is ordinary the action of Dp on U, induces a filtration of U, and also on W, 
and Vy. Suppose we write these 0 C UY CU), 0C WY € Wy. C W, and 
OC VY C Vj CV). Thus U . is defined by the requirement that D, act on it 
via the character xı (cf. (1.2)) and on U)/U} via x2. For W, the filtrations 
are defined by 


Wy {f € Wx: f(UY) c US}, 
W? = {few}: f =0 on U3}, 


and the filtrations for V) are obtained by replacing W by V. We note that 
these filtrations are often characterized by the action of D,. Thus the action 
of Dp on Wy is via x1/x2; on Wi/WY it is trivial and on W/W} it is via 
x2/x1- These determine the filtration if either x1/x2 is not quadratic or po| Dp 
is not semisimple. We define the k-vector spaces 
Vet = {feV}:f=0 in Hom(U,/U8,0)/U%)}, 

Hg_(Qp, Va) ker{H' (Qp, Va) > H'(Qp™, Va/Wy)}, 

H}a (Qp, Vy) ker{H*(Qp, Vy) = H! (Q, Vy/Vx"*)}, 

H;.. (Qp, Vy) ker{H*(Qp, Vy) =e H+ (Qp, Wa /WX) © H (Q, k) }. 


Ii 
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In the Selmer case we make an analogous definition for H4 (Qp, Wy) by 
replacing Vy by Wy, and similarly in the strict case. In the flat case we use 
the fact that there is a natural isomorphism of k-vector spaces 


H' (Qp, Va) > Extiyp,j(Ua, Uy) 


where the extensions are computed in the category of k-vector spaces with local 
Galois action. Then H}(Qp, Vy) is defined as the k-subspace of H 1(Qp, Vy) 
which is the inverse image of Ext4(G,G), the group of extensions in the cate- 
gory of finite flat commutative group schemes over Zp killed by p, G being the 
(unique) finite flat group scheme over Zp associated to U). By [Ray1] all such 
extensions in the inverse image even correspond to k-vector space schemes. For 
more details and calculations see [Ram]. 
For q different from p and q € M we have three cases (A), (B), (C). I 

case (A) there is a filtration by Dg, entirely analogous to the one for p. We 
write this 0 C wy CW, uae W) and we set 


ker : H! (Q4, Vy) 


— H!(Qa, W/W,") © H! (Q2, k) in case (A) 
HD, (Qq, Va) = 
ker : H! (Qq, Vy) 
> HQ, Vy) in case (B) or (C). 


Again we make an analogous definition for H} D, (Qg, Wx) by replacing Vy 
by W) and deleting the last term in case (A). We now define the k-vector 


Space Hp (Qs /Q, Vy) as 
Hp(Qs:/Q, Va) = {a € H'(Qs/Q, Va): aq € Hh, (Qq,Vy) for all q €M, 
ap € H; (Qp, Vy)} 


where * is Se, str, ord, fl or unrestricted according to the type of D. A similar 
definition applies to H4(Qs/Q, W3) if - is Selmer or strict. 

Now and for the rest of the section we are going to assume that pọ arises 
from the reduction of the à-adic representation associated to an eigenform. 
More precisely we assume that there is a normalized eigenform f of weight 2 
and level N, divisible only by the primes in X, and that there is a prime A 
of Of such that pọ = pf, mod À. Here Oy is the ring of integers of the field 
generated by the Fourier coefficients of f so the fields of definition of the two 
representations need not be the same. However we assume that k 2 Of A/A 
and we fix such an embedding so the comparison can be made over k. It will 
be convenient moreover to assume that if we are considering po as being of 
type D then D is defined using O-algebras where © 2 O;, is an unramified 
extension whose residue field is k. (Although this condition is unnecessary, it 
is convenient to use À as the uniformizer for O.) Finally we assume that pf 
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itself is of type D. Again this is a slight abuse of terminology as we are really 
considering the extension of scalars ps o O and not pp itself, but we will 


do this without further mention if the Beat makes it clear. (The analysis of 
this section actually applies to any characteristic zero lifting of pọ but in all 
our applications we will be in the more restrictive context we have described 
here. ) | 

With these hypotheses there is a unique local homomorphism Rp — O 
of O-algebras which takes the universal deformation to (the class of) pz. Let 
pp = ker: Rp — O. Let K be the field of fractions of O and let U; = (K/O)* 
with the Galois action taken from py. Similarly, let Vf = Ad pf à 80 K/O = 
oe /O)* with the adjoint representation so that 

V; ~ We K/O 

where Wy has Galois action via Sym? Pf, 8 det PF) , and the action on the 
second factor is trivial. Then if pọ is ordinary the filtration of Uy under the 
Ad p action of Dp induces one on Wy which we write 0 C Ww? C W; C Wy. 
Often to simplify the notation we will drop the index f from W;, V+ etc. There 
is also a filtration on Wyn = {ker à”: Wz —> Wy} given by Win = Wyn NW? 
(compatible with our previous description for n = 1). Likewise we write V)n 
for {ker A”: Vf — Vy}. 

We now explain how to extend the definition of H4 to give meaning to 
H} (Qz/Q, Van) and H} (Qs/Q, V) and these are O/A” and O-modules, re- 
spectively. In the case where po is ordinary the definitions are the same with 
Vyn or V replacing Vy and O/A” or K/O replacing k. One checks easily that 
as O-modules | 
(1.7) Hp(Qz/Q, Vin) ~ Hp(Qz/Q,V) a 
where as usual the subscript A” denotes the kernel of multiplication by A”. 
This just uses the divisibility of H°(Qy/Q,V) and H°(Q,, W/W?) in the 
strict case. In the Selmer case one checks that for m > n the kernel of 

HEQE", Van/Wiu) > HHQH", Vym (Wir) 

has only the zero element fixed under Gal(Q}™ /Qp) and the ord case is similar. 
Checking conditions at q E M is done with similar arguments. In the Selmer 
and strict cases we make analogous definitions with W)n in place of Vyn and 
W in place of V and the analogue of (1.7) still holds. 

We now consider the case where po is flat (but not ordinary). We claim 
first that there is a natural map of O-modules 


(1.8) H*(Qp, Van) > Exton, (Ux, Uy») 
for each m > n where the extensions are of O-modules with local Galois 


action. To describe this suppose that œa EH 1 (Qp, Vyn). Then we can asso- 
ciate to a a representation pa: Gal(Q,/Q,) — GLe2(On|e]) (where O,[e] = 
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Ole] /(A"e, e?)) which is an O-algebra deformation of po (see the proof of Propo- 
sition 1.1 below). Let E = Ople]? where the Galois action is via pg. Then 
there is an exact sequence 


O— ¢e€F/X\" —E/X” — (E/e)/X” — 0 
È |? 
l Uyn Uym 

and hence an extension class in Ext! (Uym, Un). One checks now that (1.8) 
is a map of O-modules. We define H} (Qp, Van) to be the inverse image of 
Ext (Un, Uy») under (1.8), i.e., those extensions which are already extensions 
in the category of finite flat group schemes Zp. Observe that Ext (Uyn, Uyn) N 
Extoip,)(U yn, Uyn) is an O-module, so H}(Q,, Van) is seen to be an O-sub- 
module of H 1(Q,, Vyn). We observe that our definition is equivalent to requir- 
ing that the classes in H} (Qp, Van) map under (1.8) to Ext@(U)m, Uyn) for all 
m > n. For if em is the extension class in Ext! (U. ym, Uyn) then €m > en BUym 
as Galois-modules and we can apply results of [Ray1] to see that em comes 
from a finite flat group scheme over Zp if en does. 

In the flat (non-ordinary) case po|r, is determined by Raynaud’s results as 


mentioned at the beginning of the chapter. It follows in particular that, since 
po|,, is absolutely irreducible, V (Qp) = H°(Qp, V) is divisible in this case 
P 


(in fact V (Qp) ~ K/O). Thus H! (Qp, Van) ~ H! (Qp, V)an and hence we can 
define 


H; (Qp, V) = g H? (Qp, Vyn), 
n=1 et 


and we claim that H} (Qp, V)an ~ Hi (Qp, Vy). To see this we have to compare 
representations for m > n, 7 


pram: Gal(Qp/Qp) — GLo(Onle]/") 


| |» 


Pm,m: Gal(Qp/Qp) —-GL2(Om le]/") 


where pnm and pmm are obtained from a, € H'(Q,, Van) and im(an) € 
H! (Qp, Vym) and Ymn: atbe — a+X™"be. By [Ram, Prop 1.1 and Lemma 
2.1] if Pn,m comes from a finite flat group scheme then so does pm,m. Conversely 
Ym,n İs injective and so Pn m comes from a finite flat group scheme if Pm,m does; 
cf. [Ray1]. The definitions of H4 (Qs/Q, Vyn) and H}(Qs,/Q, V) now extend 
to the flat case and we note that (1.7) is also valid in the flat case. p 

Still in the flat (non-ordinary) case we can again use the determination 
of polr, to see that H'(Q,, V) is divisible. For it is enough to check that 
H?(Q,, Vy) = 0 and this follows by duality from the fact that H°(Q,, V,*) =0 
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where V = Hom(Vj, wz) and py, is the group of pt? roots of unity. (Again 
this follows from the explicit form of po| p +) Much subtler is the fact that 
P 


H} (Qp, V) is divisible. This result is essentially due to Ramakrishna. For, 
using a local version of Proposition 1.1 below we have that 


Homo (pr/PR K/O) ~ Hj (Qp, V) 


where R is the universal local flat deformation ring for po|p, and O-algebras. 

(This exists by Theorem 1.1 of [Ram] because po|p, is absolutely irreducible.) 

Since R ~ RË @ O where Rf is the corresponding ring for W (k)-algebras 
W(k l 


the main theorem of [Ram, Th. 4.2] shows that R is a power series ring and 
the divisibility of H} (Qp, V) then follows. We refer to [Ram] for more details 
about Rf. 

Next we need an analogue of (1.5) for V. Again this is a variant of standard 
results in deformation theory and is given (at least for D = (ord, £, W (k), $) 
with some restriction on x1, X2 in i(a)) in [MT, Prop 25]. 


PROPOSITION 1.2. Suppose that pf is a deformation of po of type 
D=(-,0,0,M) with O an unramified extension of Ofa. Then as O-modules 


Homo(pp/pp,K/O) ~ Hp(Qs/Q,V). 


Remark. The isomorphism is functorial in an obvious way if one changes 
D to a larger D’. 


Proof. We will just describe the Selmer case with M = ¢ as the other 
cases use similar arguments. Suppose that & is a cocycle which represents a 
cohomology class in Hf,(Qs:/Q, Van). Let O,[e] denote the ring Ofe]/(A"e, €?). 
We can associate to a a representation 


Pa: Gal(Qs/Q) =} GL2(O,,[e]) 


as follows: set pa(g) = a(g)pz,,(g) where pf a(g), a priori in GL2(O), is viewed 
in GL2(O,[e]) via the natural mapping O — O,J/e]. Here a basis for O? 
is chosen so that the representation py, on the decomposition group Dp C 
Gal(Qs;/Q) has the upper triangular form of (i)(a), and then a(g) € Vyn is 
viewed in GL2(O,,[e]) by identifying | 


ZE 1 — te 


Vyn ~ r mee: Ae )} = {ker : GL2(O,{e]) —> GL2(0)}. 


Then 
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i4 YE TE 
l aus 
_ l+ye xe 7 


1 1+ye ze 
vis =4( 1 — te e 


One checks readily that pa is a continuous homomorphism and that the defor- 
mation [pa] is unchanged if we add a coboundary to a. 

We need to check that [pa] is a Selmer deformation. Let H = 
Gal(Q,/Qu"") and G = Gal(Q5™/Q,). Consider the exact sequence of O[G]- 
modules | : | 

0 > (Vyn/Win)™ > (Van/Wyn) > X 0 


and 


where X is a submodule of (V\n/V,n)’*. Since the action of Dy on Vyn /V}, is 
via a character which is nontrivial mod À (it equals X2X1 | mod à and xı # x2), 
we see that X¥ = 0 and H!(G,X) = 0. Then we have an exact diagram of 


O-modules j 


| 


H? (G, (Vyn/Wyn)’*) =~ H*(G, (Vyn/Wyn)”) 


| 


H*(Qp, Vyn /Wyn) 


| 


H¥(Q3™, Vin /WEn)9. 


By hypothesis the image of a is zero in H}( Qo, Vyn/ W»,)%. Hence it 
is in the image of H'(G,(V\n/W%,)”). Thus we can assume that it is rep- 
resented in H'(Qp, Van/ WY.) by a cocycle, which maps G to Vin /W nr; i.e., 
f(Dp) C Vh/W%, f(Ip) = 0. The difference between f and the image of a is 
a coboundary {0 ++ ot—wu} for some u € Vyn. By subtracting the coboundary 
{o +> ou — u} from a globally we get a new a such that a = f as cocycles 
mapping G to Vin/Wy,n. Thus a(D,) C Vh, a(Ip) C W®, and it is now easy 
to check that [pa] is a Selmer deformation of po. | 

Since [pa] is a Selmer deformation there is a unique map of local ØO- 
algebras ya : Rp — O,/[e] inducing it. (If M Æ ¢ we must check the 
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other conditions also.) Since pa =p f, mod e we see that restricting Ya to pp 
gives a homomorphism of O-modules, 


Ya: pp > €.0/A" 
such that yo(p3,) = 0. Thus we have defined a map y:a — ga, 


P : H$.(Qz/Q, Van) = Homo(pp/pp, O/X"). 


It is straightforward to check that this is a map of O-modules. To check the 

injectivity of y suppose that ya(pp) = 0. Then pa factors through Rp/pp ~ O 

and being an O-algebra homomorphism this determines Ya. Thus [pfa] = [Pa]. 

If A-tp,A = pf then Amode is seen to be central by Schur’s lemma and so 

may be taken to be J. A simple calculation now shows that a is a coboundary. 
To see that ọ is surjective choose 


Y € Homo(pp/pp, O/A"). 


Then py: Gal(Qg/Q) — GLo(Rp/(p3, ker Y)) is induced by a representative 
of the universal deformation (chosen to equal pf, when reduced mod pp) and 
we define a map aw : Gal(Qz/Q) — Vyn by 


1+pp/(p3,,ker V) pp/(p5,ker Y) 
ay(g) = pylg)pf alg) € i C Vyn 
pp/(p2,,ker ¥)  1+pp/(ph, ker Y) 


where pf (g) is viewed in GLo(Rp/(p%, ker Y)) via the structural map O > 
Rp (Rp being an O-algebra and the structural map being local because of 
the existence of a section). The right-hand inclusion comes from 


pp/(p,, ker V) > O/M@ 5 (O/r)-€ 
l1 & oe 


- Then ay is readily seen to be a continuous cocycle whose cohomology class 
lies in Hå (Qgz/Q, Vyn). Finally p(ay) = Y. Moreover, the constructions are 
compatible with change of n, i.e., for Vyn > Vyn+ı and A: O/A7 SG O/A". O 


We now relate the local cohomology groups we have defined to the theory 
of Fontaine and in particular to the groups of Bloch-Kato [BK]. We will dis- 
tinguish these by writing H$ for the cohomology groups of Bloch-Kato. None 
of the results described in the rest of this section are used in the rest of the 
paper. They serve only to relate the Selmer groups we have defined (and later 
compute) to the more standard versions. Using the lattice associated to pf, we 
obtain also a lattice T ~ O* with Galois action via Ad py. Let V = T Z, Qp 
be the associated vector space and identify V with V/T. Let pr: V — V be 
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the natural projection and define cohomology modules by 


H$ (Qp, Y) = ker: H+ (Qp, Y) EE H+! (Qp, cer Barys), 


H}(Qp, V) = pr (H}(Qp, v)) C H! (Qp, V), 
H(Qp, Van) = (In)! (H(Qp,V)) C H1(Qp, Van), 


where jn: Vyn — V is the natural map and the two groups in the definition 
of H}(Q,, V) are defined using continuous cochains. Similar definitions apply 
to V* = Homa, (VY, Q,(1)) and indeed to any finite-dimensional continuous 
p-adic representation space. The reader is cautioned that the definition of 
H}(Qp, Van) is dependent on the lattice T (or equivalently on V). Under 
certain conditions Bloch and Kato show, using the theory of Fontaine and 
Lafaille, that this is independent of the lattice (see [BK, Lemmas 4.4 and 
4.5|). In any case we will consider in what follows a fixed lattice associated to 
P = Pf, Ad p, etc. Henceforth we will only use the notation H}(Q», —) when 
the underlying vector space is crystalline. 


PROPOSITION 1.3. (i) If po is flat but not ordinary and psy is associated 
to a p-divisible group then for all n 


H; (Qp, Van) = Hi (Qp, Van). 
(ii) If pf, is ordinary, det ps. al, = € and pf, is associated to a p-divisible 
p 


group, then for all n, 


Hp(Qp, Van) s Hse (Qp, Van). 


Proof. Beginning with (i), we define H} (Qp, V) = {a € H!(Qp, V) : 
k(a/à”) € H} (Qp, V) for all n} where « : H1(Q,,V) > H! (Qp, V). Then 
we see that in case (i), H} (Qp, V) is divisible. So it is enough to show that 


Hp(Qp, V) = H; (Qp, V). 

We have to compare two constructions associated to a nonzero element a of 
H'(Q,, V). The first is to associate an extension 

(1.9) ee ed ee RG 


of K-vector spaces with commuting continuous Galois action. If we fix an e 
with 6(e) = 1 the action on e is defined by ve = e + &(c) with â a cocycle 
representing a. The second construction begins with the image of the subspace 
(a) in H'(Q,,V). By the analogue of Proposition 1.2 in the local case, there 
is an O-module isomorphism 


H’ (Qp, V) ~ Homo (p/p, K/O) 
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where R is the universal deformation ring of pọ viewed as a representation 
of Gal(Q,/Q,) on O-algebras and pp is the ideal of R corresponding to pp 
(i.e., its inverse image in R). Since a # 0, associated to (a) is a quotient 
pr/(p?,a) of pr/p?, which is a free O-module of rank one. We then obtain a 
homomorphism 


pa: Gal(Qp/Qp) > GLa (R/(pz, a)) 


induced from the universal deformation (we pick a representation in the uni- 
versal class). This is associated to an O-module of rank 4 which tensored with 
K gives a K-vector space E’ ~ (K)* which is an extension 


(1.10) 0-U- EF +u-0 


where U ~ K? has the Galois representation ps (viewed locally). 

In the first construction a € Hj-(Q,, V) if and only if the extension (1.9) is 
crystalline, as the extension given in (1.9) is a sum of copies of the more usual 
extension where Qp replaces K in (1.9). On the other hand (a) C H} (Qp, V) if 
and only if the second construction can be made through Rf, or equivalently if 
and only if F’ is the representation associated to a p-divisible group. (A priori, 
the representation associated to pg only has the property that on all finite 
quotients it comes from a finite flat group scheme. However a theorem of 
Raynaud [Ray1] says that then pa comes from a p-divisible group. For more 
details on Rf, the universal flat deformation ring of the local representation 
po, see [Ram].) Now the extension E’ comes from a p-divisible group if and 
only if it is crystalline; cf. [Fo, 86]. So we have to show that (1.9) is crystalline 
if and only if (1.10) is crystalline. 

One obtains (1.10) from (1.9) as follows. We view V as Homx(U, U) and 
let 


X = ker : {Homg (U, U) 8U — U} 


where the map is the natural one f & w > f(w). (All tensor products in this 
proof will be as K-vector spaces.) Then as K[D,|-modules 


F' ~ (E @U)/X. 


To check this, one calculates explicitly with the definition of the action on E 
(given above on e) and on F’ (given in the proof of Proposition 1.1). It follows 
from standard properties of crystalline representations that if E is crystalline, 
so is E & U and also E’. Conversely, we can recover E from E’ as follows. 
Consider E’ @U ~ (E &U @U)/(X @U). Then there is a natural map 
yp: EQ (det) — E’ @U induced by the direct sum decomposition U @U ~ 
(det) $ Sym? U. Here det denotes a 1-dimensional vector space over K with 
Galois action via det pf. Now we claim that ¢ is injective on V @ (det). For 
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if f € V then y(f) = f 8 (wi @ we — we Q w1) where w1, we are a basis for U 
for which w1Aw2 = 1 in det ~ K. So if y(f) € X QU then 


f(wi) 8 we — f(we) 8 wi =0 in U@U. 


But this is false unless f(wi) = f(w2) = 0 whence f = 0. So y is injective 
on V & det and if ọ itself were not injective then E would split contradicting 
a #0. So is injective and we have exhibited E @ (det) as a subrepresentation 
-of E" @U which is crystalline. We deduce that E is crystalline if E’ is. This 
completes the proof of (i). | 
To prove (ii) we check first that Hå (Qp, Van) = jz! (Ha, (Qo, v)) (this 

was already used in (1.7)). We next have to show that H} (Qp, V) C Hd,(Qp, V) 
where the latter is defined by 


H3.(Qp, V) = ker : H*(Qp, V) + H! (QE, V/V°) 


with V? the subspace of V on which J, acts via e. But this follows from the 
computations in Corollary 3.8.4 of [BK]. Finally we observe that 


pr (H. (Qp, v)) c Hg.(Qp, V) 
although the inclusion may be strict, and 
pr (Hp(Qp; V)) = Hi(Qp, V) 
by definition. This completes the proof. o 


These groups have the property that for s > r, 
(1.11) H? (Qp, Var) N izt ( HHQ, Vae)) = HB(Qp, Var) 


where Jrs : Vyr — Vys is the natural injection. The same holds for Vy, and 
Vy; in place of V)r and Vys where Vý. is defined by 


Výr = Hom(V\r ; hpr) 


and similarly for V$. Both results are immediate from the definition (and 
indeed were part of the motivation for the definition). 

We also give a finite level version of a result of Bloch-Kato which is easily 
deduced from the vector space version. As before let T C V be a Galois stable 
lattice so that T ~ O4. Define 


HAQ, T) =i (HB(Qp, V) 


under the natural inclusion i : T — V, and likewise for the dual lattice T* = 
Homz, (V, (Qp/Zp)(1)) in V*. (Here V* = Hom(V, Q,(1)); throughout this 
paper we use M* to denote a dual of M with a Cartier twist.) Also write 
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pr, : T — T/A” for the natural projection map, and for the mapping it 
induces on cohomology. | | 


PROPOSITION 1.4. If pz is associated to a p-divisible group (the ordi- 
nary case is allowed) then : 


(i) pr, (H}(Q>,T)) = H}(Q,,T/A”") and similarly for T*,T*/X". 


(ii) HE(Qp, Van) is the orthogonal complement of H}(Qp, Vy.) under Tate 
local duality between H! (Qp, Vyn) and H! (Qp, Vx.) and similarly for Wyn 
and Wy, replacing Vyn and Vý. 


More generally these results hold for any crystalline representation V’ in 

place of V and X a uniformizer in K' where K' is any finite extension of Qp 
/ at / 
with K' C Endga(q,/Qp) y. 


Proof. We first observe that pr, (H}(Q,, T)) C Hp(Qp, T/A"). Now 
from the construction we may identify T/A” with Vy)n. A result of Bloch- 
Kato ([BK, Prop. 3.8]) says that H}(Q,, V) and H}(Q,, V*) are orthogonal © 
complements under Tate local duality. It follows formally that H}(Q,, Vš) 
and pr, (HŁ(Qp, T)) are orthogonal complements, so to prove the proposition 
it is enough to show that 


(1.12) # He(Qp, Vin) # He(Qp, Vin) = # H’ (Qp, Van). 
Now if r=dimx H}(Q,, V) and s = dimg H;7(Q,, V*) then 
(1.13) r +s = dim H(Q,, V) + dimp H? (Qp, V*) + dimg V. 


From the definition, 
(1.14)  # Hp(Qp, Var) =# (O/A")" - # ker{H' (Qp, Van) > H' (Qp, V)}. 


The second factor is equal to # {v (Q,)/A" V (Q,)}. When we write V (Qp) # 
for the maximal divisible subgroup of V (Qp) this is the same as 


# (V(Qp)/V(Qp)*”)/A” # (V(Qp)/V(Qp)*”)n 
| | # V(Qp)an/# (V (Qp) E)r. 


Combining this with (1.14) gives 
(1.15) # Hp(Qp, Vae) = # (O/A")" 
Ht H? (Qp, Van) / # (O/*/H OMY), 
This, together with an analogous formula for # Hp(Qp, Vyn) and (1.13), gives 
# Hi(Qp, Van) # HP(Qp, Vin) = # (O/A")*-4# H° (Qp, Van) # H° (Qp, Vin). 
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As #H° (Qp, Vx.) = # H? (Qp, Van) the assertion of (1.12) now follows from 
the formula for the Euler characteristic of Vyn. | 

The proof for W)n, or indeed more generally for any crystalline represen- 
tation, is the same. o 


We also give a characterization of the orthogonal complements of 
H4 (Qp, Wan) and Ht (Qp, Van), under Tate’s local duality. We write these 
duals as Hå» (Qp, Wn) and HZ.«(Qp, Vyn) respectively. Let 


w: H! (Qp, Wyn) nas H! (Qp, Wyn /(Wyn)”) 


be the natural map where (Ws) is the orthogonal complement of Wis in 
Wx,, and let Xn; be defined as the image under the composite map 


hie ate Z% /(Z% 7" 30/3" — H'(Qp, hpn 8 0/2") 
> H'(Qp, W3n/(W3n)”) 


where in the middle term p,n ® O/A" is to be identified with (W%,)1/ ie) 
Similarly if we replace W¥, by Vy, we let Y,,; be the image of Z% / (Z™)P" 
(O/2")? in H1(Qp, Vin /(Wyn)°), and we replace py by the won map pv. 


PROPOSITION 1.5. 


Hgo” (Qp, Win) = Di: (Xni), 
He (Qp, Vin) yy (Yni). 


Proof. This can be checked by dualizing the sequence 


0 > HlulQp, Wan) > Hd.(Qp, Wan) 
— ker: {H! (Qp, Wyn/(Wyn)°) > HHQ, Wyn/(Wan)?}, 


where H4, (Qp, Wyn) = ker : H” (Qp, Wyn) > H (Qp, Wan /(Wan)"). The 
first term is orthogonal to ker : H1(Qy, W3n) + H1 (Qp, Win /(W3n)*). By the 
naturality of the cup product pairing with respect to quotients and subgroups 
the claim then reduces to the well known fact that under the cup product 
pairing 


H! (Qp, [pn ) x H! (Qp, Z/p") — Z/p” 
the orthogonal complement of the unramified homomorphisms is the image of 


the units Z% /(Z% )P" — H*(Qp, Hp). The proof for Vyn is essentially the 
same. | | CO 
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2. Some computations of cohomology groups 


We now make some comparisons of orders of cohomology groups using 
the theorems of Poitou and Tate. We retain the notation and conventions of - 
Section 1 though it will be convenient to state the first two propositions in a 
more general context. Suppose that 


L=[[2,¢ 2 QX) 
qeu 


is a subgroup, where X is a finite module for Gal(Qs/Q) of p-power order. 
We define L* to be the orthogonal complement of L under the perfect pairing 
(local Tate duality) 


I] H! (Q4, X) x I] H*(Q,, X*) mid Qp/Zp 
qed : qeu l 
where X* = Hom(X, Hœ). Let 
Ax : H!(Qs/Q, X) > JT] HQ, X) 
qe 
be the localization map and similarly Ax» for X*. Then we set 
Hi(Qx/Q,X) = Ax (L), Hi (Qs/Q, X*) = Ax (L*). 
The following result was suggested by a result of Greenberg (cf. [Gre1]) and 


is a simple consequence of the theorems of Poitou and Tate. Recall that p is 
always assumed odd and that p E€ È. 


PROPOSITION 1.6. 
#H}(Qs/Q,X) / #H]+(Qz/Q,X*) = hoo | | ha 
qe 


_where 


l hg = #H®(Qy,X*)/[H*(Qq,X):Lq] 
hæ = #H°(R, X*) #H°(Q,X)/#H°(Q,X*). 


Proof. Adapting the exact sequence of Poitou and Tate (cf. [Mi2, Th. 4.20}) 
we get a seven term exact sequence 


0 — Hj(Qz/Q,X) — H'*(Qs/Q,X) — I H (Qa, X)/La 
i q 


I] H? (Q4, X) e H?’ (Qgz/Q, X) Een H}, (Qz/Q, X*)^ 


qeu 


l H? (Qs/Q, X*)^ a 0, 
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where M^ = Hom(M, Qp/Zp). Now using local duality and global Euler char- 
acteristics (cf. [Mi2, Cor. 2.3 and Th. 5.1]) we easily obtain the formula in the 
proposition. We repeat that in the above proposition X can be arbitrary of 
p-power order. O 


We wish to apply the proposition to investigate H}. Let D = (£, 0, M) 
be a standard deformation theory as in Section 1 and define a corresponding 
group Ln = LD,n by setting 


H! (Qa, Var) forq#p and q¢M 
Lang = Hy (Qa; Vyn) forg#p and qeEM 


H.1(Qp, Vin) =‘ for g =p. 


Then Hp(Qs/Q, Van) = Hj, (Qs/Q, Vyn) and we also define 
Hb. (Qz/Q, Vin) = Hi; (Qu/Q, Vin). 


We will adopt the convention implicit in the above that if we consider X’ > X 
then H} (Qyx/Q, Vyn) places no local restriction on the cohomology classes at 
primes q € ©’— X. Thus in H$. (Qs /Q, Vin) we will require (by duality) that 
the cohomology class be locally trivial at q € X’ — Ð. 

We need now some estimates for the local cohomology groups. First we 
consider an arbitrary finite Gal(Qy/Q)-module X: 


| PROPOSITION 1.7. Ifq ¢ X, and X is an arbitrary finite Gal(Qy/Q)- 
module of p-power order, 


#HE (Qeuq/Q,X)/#Hi(Qz/Q,X) < #H"(Qy,X") 
where L, = Ly for LE £E and Li, = H! (Q4, X). 
` Proof. Consider the short exact sequence of inflation-restriction: 


0 > H;(Qs/Q, X) > HE (Qzu/Q, X) > Hom(Gal(Qsug/Qz), X)F®R=/9) 


| 


HHQ, X NCR /Q) + HHQH, X)CQ/Q) 


The proposition follows when we note that 


#H° (Qa, X*) = #H(QU™, X)EARIQ), o 


Now we return to the study of Vyn and Wzn. 
PROPOSITION 1.8. IfqEM (q#p) and X = Vyn then hg = 1. 
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Proof. This is a straightforward calculation. For example if q is of type 
(A) then we have 


Lag = ker{H' (Qq, Var) > H' (Qa, Wan/Wy3n) ® H’ (Q7, O/A")}. 
Using the long exact sequence of cohomology associated to 
0 —> we, > Wyn > W)n/W2n => 0 


one obtains a formula for the order of Ing in terms of #H *(Qg, Won), 
#:H'*(Q,, Wyn/W?,) etc. Using local Euler characteristics these are easily re- 
duced to ones involving H°(Q,, Wx) etc. and the result follows easily. 0 


The calculation of hp is more delicate. We content ourselves with an 
inequality in some cases. 3 


PROPOSITION 1.9. (i) If X = Vyn then 
hp hoo = #(O/d)°" # H° (QpVin)/ # H? (Q, Vš) 


in the unrestricted case. 

(ii) If X = Vyn then 

hp hoo < #(O/A)” # H? (Qp, (Vir")*)/#H(Q,W in) 
in the ordinary case. 
(iti) If X = Vn or Wyn then hp hoo < #H°(Qp, (Wyn)*)/ #H° (Q, Win) 

in the Selmer case. . | 

(iv) If X = Vyn or Wyn then hphoœo =1 in the strict case. 

(v) If X = Vyn then hp hoo = 1 in the flat case. 

(vi) If X = Van or Wyn then hpho = 1/#H°(Q,Vyn) tf Lnp = 
H}(Q,,X) and ps, arises from an ordinary p-divisible group. 

Proof. Case (i) is trivial. Consider then case (iii) with X = V\n. We have 
a long exact sequence of cohomology associated to the exact sequence: 


(1.16) 05> Wh > Vm > Van/Wen > 0. 
In particular this gives the map u in the diagram 


H! (Qp, Vyn) 


1> Z= H (QH /Qy, (Var /W8n)") >H (Qp, Var (WS) > HEQ", Van WS)? 1 


= where G = Gal(Q}™/Qp), H = Gal(Qp/Qp™) and 6 is defined to make the 


triangle commute. Then writing h;(M) for #H‘(Qp, M) we have that #Z = 
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ho(Van/W9„) and #imé > (#imu)/(#Z). A simple calculation using the 
long exact sequence associated to (1.16) gives 

_ ha(Van/Wya)h2(Van) 

1.17 mu = eee 
on ime = FW. haa WR) | 


Hence 
[H° (Qp, Van): np] = #imô > > #(O/)*"ho(Vgn)/ho(W2n"). 
The inequality in (iii) follows for X = Vyn and the case X = W)n is similar. 
Case (ii) is similar. In case (iv) we just need #imu which is given by (1.17) 
with Wn replacing Vyn. In case (v) we have already observed in Section 1 that 
Raynaud’s results imply that # H (Qp, Va) = 1 in the flat case. Moreover 
# H1 (Qp, Van) can be computed to be #(O/A)?" from | 
H? (Qp, Van) ~ He (Qp, V)an ~ Homo (pr/pp, K/O) an 


where R is the universal local flat deformation ring of po for O-algebras. Using 
the relation R ~ RÊ a where Rf is the corresponding ring for W(k)- 
W(k 


algebras, and the main theorem of [Ram] (Theorem 4.2) which computes RÊ, 
we can deduce the result. 

We now prove (vi). From the definitions 
(#0/A")" #H°(Qp, Wyn) if pfalD, does not split 
(#0/A")’ if pg,alp, splits 
where r = dimx HŁ (Qp, Y). This we can compute using the calculations in 


[BK, Cor. 3.8.4]. We find that r = 2 in the non-split case and r = 3 in the 
split case and (vi) follows easily. a 


# H$ (Qp, Vyn) = l 


3. Some results on subgroups of GL2(k) 


We now give two group-theoretic results which will not be used until 
Chapter 3. Although these could be phrased in purely group-theoretic terms — 
it will be more convenient to continue to work in the setting of Section 1, i.e., 
with po as in (1.1) so that im pọ is a subgroup of GL2(k) and det pọ is assumed 

odd. | 


LEMMA 1.10. Jfimpo has order divisible by p then: 


(i) It contains an element Yo of order ; m > 3 with (m,p) = 1 and % trivial 
on any abelian quotient of im po. 

(ii) It contains an element polo) with any prescribed image in the Sylow. 
2-subgroup of (im po)/(im po)’ and with the ratio of the eigenvalues not equal 
to w(o). (Here (im po)’ denotes the derived subgroup of (im po).) 
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The same results hold if the image of the projective representation po as- 
sociated to po is isomorphic to A4, S4 or As. 


Proof. (i) Let G = impo and let Z denote the center of G. Then we 
have a surjection G’ — (G/Z)’ where the ’ denotes the derived group. By 
Dickson’s classification of the subgroups of GLo(k) containing an element of 
order p, (G/Z) is isomorphic to PGL2(k’) or PSL2(k’) for some finite field k’ of 
characteristic p or possibly to As when p = 3, cf. [Di, §260]. In each case we can 
find, and then lift to G’, an element of order m with (m, p) = 1 and m > 3, 
except possibly in the case p = 3 and PSLo(F3) ~ A4 or PGLo(F3) ~ S4. 
However in these cases (G/Z)' has order divisible by 4 so the 2-Sylow subgroup 
of G’ has order greater than 2. Since it has at most one element of exact order 
2 (the eigenvalues would both be —1 since it is in the kernel of the determinant 
and hence the element would be —/) it must also have an element of order 4. 

The argument in the A4, S4 and As cases is similar. 

(ii) Since pp is assumed absolutely irreducible, G = im po has no fixed line. 
We claim that the same then holds for the derived group G’. For otherwise 
since G” 4G we could obtain a second fixed line by taking (gv) where (v) is the 
original fixed line and g is a suitable element of G. Thus G’ would be contained 
in the group of diagonal matrices for a suitable basis and either it would be 
central in which case G would be abelian or its normalizer in GLo(k), and 
hence also G, would have order prime to p. Since neither of these possibilities 
is allowed, G” has no fixed line. | 

By Dickson’s classification of the subgroups of GLe(k) containing an el- 
ement of order p the image of im pọ in PGL2(k) is isomorphic to PGLo(k’) 
or PSL2(k’) for some finite field k’ of characteristic p or possibly to A5 when 
p = 3. The only one of these with a quotient group of order p is PSL2(F3) 
when p = 3. It follows that p + [G : G’] except in this one case which we treat 
separately. So assuming now that p { [G : G'] we see that G” contains a non- 
trivial unipotent element u. Since G” has no fixed line there must be another 
noncommuting unipotent element v in G’. Pick a basis for po|g consisting 
of their fixed vectors. Then let 7 be an element of Gal(Qs/Q) for which the 
image of po(T) in G/G’ is prescribed and let po(r) = (¢ ae Then 


5s. {2 b 1 sa 1 
led 1 r8 1 
has det(5) = det po(r) and trace ô = sa(raß +c) +br8 +a +d. Since p > 3 
we can choose this trace to avoid any two given values (by varying s) unless 
raß +c = 0 for all r. But raß + c cannot be zero for all r as otherwise 


a = c = 0. So we can find a 6 for which the ratio of the eigenvalues is not 
w(T), det(6) being, of course, fixed. 
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Now suppose that im po does not have order divisible by p but that the 
associated projective representation po has image isomorphic to S4 or As, so 
necessarily p Æ 3. Pick an element 7 such that the image of po(7) in G/G’ is 
any prescribed class. Since this fixes both det po9(7) and w(T) we have to show 
that we can avoid at most two particular values of the trace for r. To achieve 
this we can adapt our first choice of r by multiplying by any element of G’. So 
pick ø € G’ as in (i) which we can assume in these two cases has order 3. Pick 
a basis for po, by extending scalars if necessary, so that o +> (° Sci Then one 
checks easily that if po(r) = ( $) we cannot have the traces of all of 7, ør and 
a?r lying in a set of the form {+t} unless a = d = 0. However we can ensure 
that po(7) does not satisfy this by first multiplying 7 by a suitable element of 
G” since G’ is not contained in the diagonal matrices (it is not abelian). 

In the A, case, and in the PSL2(F3) ~ A4 case when p = 3, we use a 
different argument. In both cases we find that the 2-Sylow subgroup of G/G' 
is generated by an element z in the centre of G. Either a power of z is a suitable 
candidate for po(o) or else we must multiply the power of z by an element of 
G’, the ratio of whose eigenvalues is not equal to 1. Such an element exists 
because in G’ the only possible elements without this property are {FI} (such 
elements necessarily have determinant 1 and order prime to p) and we know 
that #G’ > 2 as was noted in the proof of part (i). o 


Remark. By a well-known result on the finite subgroups of PGL2 (Fp) this 
lemma covers all pọ whose images are absolutely irreducible and for which po 
is not dihedral. 


Let Kı be the splitting field of pọ. Then we can view W, and W¥ as 
Gal(Kı(p)/Q)-modules. We need to analyze their cohomology. Recall that 
we are assuming that po is absolutely irreducible. Let po be the associated 
projective representation to PGLo(k). | 


The following proposition is based on the computations in [CPS]. 
PROPOSITION 1.11. Suppose that po is absolutely irreducible. Then 


H! (Kı (&)/Q, Wx) = 0. 


Proof. If the image of pọ has order prime to p the lemma is trivial. The 
subgroups of GL2(k) containing an element of order p which are not contained 
in a Borel subgroup have been classified by Dickson [Di, §260] or [Hu, II.8.27] 
Their images inside PGL2(k’) where k’ is the quadratic extension of k are 
conjugate to PGL2(F) or PSL2(F) for some subfield F of k’, or they are 
isomorphic to one of the exceptional groups Ay, S4, As. 

Assume then that the cohomology group H1(K1(¢,)/Q, W*) # 0. Then 
by considering the inflation-restriction sequence with respect to the normal 


478 ANDREW WILES 


subgroup Gal(K1(G))/K1) we see that ¢, € K1. Next, since the representation 
is (absolutely) irreducible, the center Z of Gal(K,/Q) is contained in the 
diagonal matrices and so acts trivially on W). So by considering the inflation- 
restriction sequence with respect to Z we see that Z acts trivially on ¢, (and 
on W*). So Gal(Q(¢,)/Q) is a quotient of Gal(K1/Q)/Z. This rules out all 
cases when p Æ 3, and when p = 3 we only have to consider the case where the 
image of the projective representation is isomorphic as a group to PGL2 (F) 
for some finite field of characteristic 3. (Note that S4 ~ PGL2(F3).) 

Extending scalars commutes with formation of duals and H1, so we may 
assume without loss of generality F C k. If p = 3 and #F > 3 then 
H! (PSL(F), W\) = 0 by results of [CPS]. Then if pọ is the projective 
representation associated to pọ suppose that g`! im pọ g = PGL2(F) and let 
H = g PSL2(F)g7!. Then W, ~ W* over H and 


(1.18) H! (H, Wa) @F ~ H!(g`1Hg, g` (Wg F)) = 0. 


We deduce also that H'(impo, W¥) = 0. 

Finally we consider the case where F = F3. I am grateful to Taylor for the 
following argument. First we consider the action of PSL2(F3) on W, explicitly 
by considering the conjugation action on matrices {A € M2(F3) : trace A = 0}. 
One sees that no such matrix is fixed by all the elements of order 2, whence 


H! (PSL2(F3), Wy) ~ H1(Z/3, (W,)@*@) =0 


where C2 x C2 denotes the normal subgroup of order 4 in PSL2(F3) ~ Ag. Next 
we verify that there is a unique copy of A4 in PGL2(F3) up to conjugation. 
For suppose that A, B € GLo(F3) are such that A? = B? = I with the images 
of A, B representing distinct nontrivial commuting elements of PGL2(F3). We 
can choose A = ( 9) by a suitable choice of basis, i.e., by a suitable conju- 
gation. Then B is diagonal or antidiagonal as it commutes with A up to a 
scalar, and as B, A are distinct in PGL2(F3) we have B = (e =) for some 
a. By conjugating by a diagonal matrix (which does not change A) we can 
assume that a = 1. The group generated by {A, B} in PGL2(F3) is its own 
centralizer so it has index at most 6 in its normalizer N. Since N/(A, B) ~ S3 
there is a unique subgroup of N in which (A, B) has index 3 whence the image 
of the embedding of A4 in PGL2(F3) is indeed unique (up to conjugation). So 
arguing as in (1.18) by extending scalars we see that H1(im pọ, W*) = 0 when 
F = Fs also. oO 


The following lemma was pointed out to me by Taylor. It permits most 
dihedral cases to be covered by the methods of Chapter 3 and [TW]. 


LEMMA 1.12. Suppose that po is absolutely irreducible and that 


(a) Po is dihedral (the case where the image is Z/2 x Z/2 is allowed), 
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(b) po|, is absolutely irreducible where L = Q ( (—1)-1)/ p). 


= Then for any positive integer n and any irreducible Galois stable subspace X 
of Wx 8 k there exists an element o € Gal(Q/Q) such that 


(i) Solo) #1, 
(ii) o fixes Q(Gp"), 


(iii) o has an eigenvalue 1 on X. 


Proof. If fo is dihedral then pp Q k = Ind& x for some H of index 2 in G, 
where G = Gal(K1/Q). (As before, Kı is the splitting field of po.) Here H 
can be taken as the full inverse image of any of the normal subgroups of index 
2 defining the dihedral group. Then W, ® k ~ 6 @ Ind&(x/x’) where 6 is the 
quadratic character G — G/H and y’ is the conjugate of x by any element of 
G — H. Note that x # x’ since H has nontrivial image in PGLo(k). 

To find a ø such that (o) = 1 and conditions (i) and (ii) hold, observe 
that M(¢p.) is abelian where M is the quadratic field associated to 6. So 
conditions (i) and (ii) can be satisfied if fo is non-abelian. If fo is abelian (i.e., 
the image has the form Z/2 x Z/2), then we use hypothesis (b). If Ind%(x/x’) 
is reducible over k then W, Q k is a sum of three distinct quadratic characters, 
none of which is the quadratic character associated to L, and we can repeat 
the argument by changing the choice of H for the other. two characters. If 
X = Ind$(x/ x’) Q k is absolutely irreducible then pick any ø € G — H. This 
_ satisfies (i) and can be made to satisfy (ii) if (b) holds. Finally, since o € G—H 
we see that ø has trace zero and o* = 1 in its action on X. Thus it has an 
eigenvalue equal to 1. oO 


Chapter 2 


In this chapter we study the Hecke rings. In the first section we recall 
some of the well-known properties of these rings and especially the Goren- 
stein property whose proof is rather technical, depending on a characteristic _ 
p version of the g-expansion principle. In the second section we compute the 
relations between the Hecke rings as the level is augmented. The purpose is to 
find the change in the 7-invariant as the level increases. 

In the third section we state the conjecture relating the deformation rings 
of Chapter 1 and the Hecke rings. Finally we end with the critical step of 
showing that if the conjecture is true at a minimal level then it is true at — 
all levels. By the results of the appendix the conjecture is equivalent to the 
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equality of the 7-invariant for the Hecke rings and the p /p?-invariant for the 
deformation rings. In Chapter 2, Section 2, we compute the change in the 
n-invariant and in Chapter 1, Section 1, we estimated the change in the p/p?- 
invariant. 


1. The Gorenstein property 


For any positive integer N let X(N) = X1(N) /qQ be the modular curve 
over Q corresponding to the group [(N) and let J,(N) be its Jacobian. Let 
T (NV) be the ring of endomorphisms of Jı (N) which is generated over Z by the 
standard Hecke operators {T; = Tj, for l { N, Ug = Ugs for q | N, (a) = (a), 
for (a,N) = 1}. For precise definitions of these see [MW1, Ch. 2, §5]. In 
particular if one identifies the cotangent space of Jj(N)(C) with the space of 
cusp forms of weight 2 on I’; (V) then the action induced by T(N) is the usual 
one on cusp forms. We let A = {(a) : (a, N) = 1}. 

The group (Z/NZ)* acts naturally on X,(N) via A and for any sub- 
group H C (Z/NZ)* we let Xy(N) = X(N) qQ be the quotient X,(N)/H. 
Thus for H = (Z/NZ)* we have X(N) = Xo(N) corresponding to the group 
To(N). In Section 2 it will sometimes be convenient to assume that H decom- 
poses as a product H = || H in (Z/NZ)* ~ J](Z/q"Z)* where the product 
is over the distinct prime powers dividing N. We let Jz(N) denote the Ja- 
cobian of X7(N) and note that the above Hecke operators act naturally on 
Jy(N) also. The ring generated by these Hecke operators is denoted Ty(N) 
and sometimes, if H and N are clear from the context, we abbreviate this 
to T. 

Let p be a prime > 3. Let m be a maximal ideal of T = Ty(N) with 
p Em. Then associated to m there is a continuous odd semisimple Galois 
representation pm, 


(2.1) Pm: Gal(Q/Q) > GL2(T/m) 
unramified outside N p which satisfies 
trace pm(Frobq) = Ty, det pm(Frobq) = (q)q 


for each prime q { Np. Here Frobg denotes a Frobenius at q in Gal(Q/Q). 
The representation pm is unique up to isomorphism. If p { N (resp. p | N) we 
say that m is ordinary if T, ¢ m (resp. Up ¢ m). This implies (cf., for example, 
theorem 2 of [Wil]) that for our fixed decomposition group D, at p, 


ef AE F 
Dp 0 x2 


for a suitable choice of basis, with x2 unramified and x2(Frob p) = Tp mod 
m (resp. equal to Up). In particular pm is ordinary in the sense of Chapter 1 


Pm 
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provided x; Æ x2. We will say that m is D,-distinguished if m is ordinary and 
xı Æ x2. (In practice xı is usually ramified so this imposes no extra condition. ) 
We caution the reader that if pm is ordinary in the sense of Chapter 1 then we 
can only conclude that m is D,-distinguished if p { N. 

Let T,, denote the completion of T at m so that T,, is a direct factor of the 
complete semi-local ring T, = T & Zp. Let D be the points of the associated 
m-divisible group 


D = Ju(N) (Q)m =~ Ju(N) (Q)p @ Tm 


Pp 


It is known that D = Homz,,(D, Qp/Zp) is a rank 2 T,,-module, i.e., that 
D a Q= Ae A Qp)?. Briefly it is enough to show that H'(Xy(N),C) is 


hae of rank 2 eek T & C and this reduces to showing that S2 (T(N), ©), 
the space of cusp forms of weight 2 on T(N), is free of rank 1 over T & C. 
One shows then that if {f;,..., fr} is a complete set of normalized newforms | 
in So(['y(N), C) of levels m1,...,m, then if we set di = N/m;, the form 
f =X fi(diz) is a basis vector of So([y(N), C) as a T & C-module. 

If m is ordinary then Theorem 2 of [Wil], itself a straightforward gener- 
alization of Proposition 2 and (11) of [MW2], shows that (for our fixed de- 
composition group D,) there is a filtration of D by Pontrjagin duals of rank 1 
T,n-modules (in the sense explained above) _ 


(2.2) 0> D> D> D* = 0 


where D? is stable under D, and the induced action on D* is unramified with 
Frob p = U, on it if p | N and Frobp equal to the unit root of z? — Tpx + p(p) 
= 0 in Tm if p { N. We can describe D° and DË as follows. Pick a ø € 
Ip which induces a generator of Gal (Qp(Cwp~)/Qp(Gnp))- Let e: Dp > Z 
be the cyclotomic character. Then D? = ker (o — é(o))“", the kernel being 
taken inside D and ‘div’ meaning the maximal divisible subgroup. Although 
in [Wil] this filtration is given only for a factor Aş of Jı(N) it is easy to 
deduce the result for Jy(N) itself. We note that this filtration is defined 
without reference to characteristic p and also that if m is D,-distinguished, p? 
(resp. DË) can be described as the maximal submodule on which o — ¥i(¢) 
is topologically nilpotent for all ø € Gal(Q, /Qp) (resp. quotient on which 
ø — X2(c) is topologically nilpotent for all ø € Gal(Q,/Qp)), where X:(c) is 
any lifting of x;(0) to Tm. 

The Weil pairing ( , ) on Jy(N)(Q) m satisfies the relation (t.r,y) = 
(x,t*y) for any Hecke operator t. It is more convenient to use an adapted 
pairing defined as follows. Let we, for Ç a primitive N th root of 1, be the 
involution of X(N) /qQcc) defined in [MW1, p. 235]. This induces an involution 
of XH(N) Qc) also. Then we can define a new pairing [ , ] by setting (for a 
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(2.3) 0 ix, y] = (ar, wey). 


Then [t,2, y] = [x,t.y] for all Hecke operators t. In particular we obtain an 
induced pairing on Dm. | 

The following theorem is the crucial result of this section. It was first 
proved by Mazur in the case of prime level [Ma2]. It has since been generalized 
in [Til], [Ril] [M Ri], [Gro] and [E1], but the fundamental argument remains 
that of [Ma2]. For a summary see [E1, §9]. However some of the cases we need 
are not covered in these accounts and we will present these here. 


THEOREM 2.1. (i) Ifp { N and pm is irreducible then 
Jn(N)(Q)[m] ~ (T/m)?. 

(ii) Ifp { N and pm is irreducible and m is Dy-distinguished then — 
Ju(Np)(Q)[m] = (T/m). 


(In case (ii) m is a mazimal ideal of T = Ty(Np).) 


COROLLARY 1. In case (i), Ju(N)(Q),. ~ TÊ and Tam (Ja(N) (Q)) ~ 


2 
T2. 


In case (ii), Jau(Np)\(Q)„ ~ T2 and Tam (Ju(Np) (Q)) ~ T? (where 
Ta = Ta(Np)m) 


COROLLARY 2. In either of cases (i) or (ii) Tm is a Gorenstein ring. 


In each case the first isomorphisms of Corollary 1 follow from the theorem 
together with the rank 2 result alluded to previously. Corollary 2 and the 
second isomorphisms of corollary 1 then follow on applying duality (2.4). (In 
the proof and in all applications we will only use the notion of a Gorenstein 
Zp-algebra as defined in the appendix. For finite flat local Z,-algebras the 
notions of Gorenstein ring and Gorenstein Zp-algebra are the same.) Here 


Tam (J H(N ) (Q)) = Tog (J H(N) (Q)) g Tm is the m-adic Tate module of 
p 
Jg(N). 
We should also point out that although Corollary 1 gives a representation 
from the m-adic Tate module 


P = pr,,: Gal(Q/Q) > GL2 (Tm) 


_ this can be constructed in a much more elementary way. (See [Ca3] for another 
argument.) For, the representation exists with Tm Q Q replacing Tm when we 
use the fact that Hom(Q,/Zp, D) ®Q was free of rank 2. A standard argument 
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using the Eichler-Shimura relations implies that this representation p' with 
values in GL2(Tm Q Q) has the property that 


trace p'(Frob £) = Te, det p'(Frob £) = £(é) | 


for all 2 + Np. We can normalize this representation Ri picking a complex 


conjugation c and choosing a basis such that p'(c) = Ga D and then by picking 


a T for ge p(r) = (€ a with b,c, # 0(m) and by rescaling the basis so 
that b, = 1. (Note that the explicit Cese iphon; of the traces shows that if pm 
is also peeled so that pm(c) = 3) then b,c, mod m = br,mCr,m Where 


put) = eel The existence of a 7 such that brc, # O(m) comes from 
the irreducibility of Pm.) With this normalization one checks that p’ actually 
takes values in the (closed) subring of Tm generated over Zp by the traces. 
One can even construct the representation directly from the representations in 
Theorem 0.1 using this ring which is reduced. This is the method of Carayol 
which requires also the characterization of p by the traces and determinants 
(Theorem 1 of [Ca3]). One can also often interpret the Uq operators in terms 
of p for q | N using the m4 ~ 7(oq) theorem of Langlands (cf. [Ca1]) and the 


U, operator in case (ii) using Theorem 2.1.4 of [Wil]. 


Proof (of theorem). The important technique for proving such multiplicity- 
one results is due to Mazur and is based on the g-expansion principle in char- 
acteristic p. Since the kernel of J#(N)(Q) — Jı(N)(Q) is an abelian group on 
which Gal(Q/Q) acts through an abelian extension of Q, the intersection with 
kerm is trivial when pm is irreducible. So it is enough to verify the theorem 
for Jı(N) in part (i) (resp. Ji(Np) in part (ii)). The method for part (i) was 
developed by Mazur in [Ma2, Ch. II, Prop. 14.2]. It was extended to the case 
of T'o(N) in [Ril, Th. 5.2] which summarizes Mazur’s argument. The case of 
Tı(N) is similar (cf. [E1, Th. 9.2]). 

Now consider case (ii). Let Ap) = = {(a):a=1(N)} CA. Let us first 
assume that A(,) is nontrivial mod m, i.e., that 6—1 ¢ m for some 6 €. Aw): This 
case is essentially covered in [Til] (and ne in [Gro]). We pny review the 
argument for use later. Let K= Qp(Cp), Gp being a primitive p‘® root of unity, 
and let © be the ring of integers of the completion of the maximal unramified 
extension of K. Using the fact that Ap) is nontrivial mod m together with 
Proposition 4, p. 269 of [MW1] we find that 

Np) o (Fp) ~ (Pic? E$ x Pic? =’) m (Fp) 
where the notation is taken from [MW1] loc. cit. Here ft and Xf are the 
two smooth irreducible components of the special fibre of the canonical model 
of Xi(Np)/o described in [MW1, Ch. 2]. (The smoothness in this case was 
proved in [DR].) Also Ji(Np)* jo denotes the canonical étale quotient of the 
m-divisible group over O. This makes sense because Jı(Np)m does extend to 
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a p-divisible group over O (again by a theorem of Deligne and Rapoport [DR] 
and because Ap) is nontrivial mod m). It is ordinary as follows from (2.2) when 
we use the main theorem of Tate ([Ta]) since D? and DË clearly correspond 
to ordinary p-divisible groups. 

Now the g-expansion principle implies that dimp, X[m] < 1 where 


X = {H° (£$, 0) @ H (ZT, 2)} 


and m’ is defined by embedding T/m — F, and setting m’ = ker: T 8 F, — Fp 
under the map t &a | atmodm. Also T acts on Pic? Hf x Pic? Df, the 
abelian variety part of the closed fibre of the Neron model of Jı(Np);o, and 
hence also on its cotangent space X. (For a proof that X[m] is at most one- 
dimensional, which is readily adapted to this case, see Lemma 2.2 below. For 
similar versions in slightly simpler contexts see [Wi3, §6] or [Gro, §12].) Then 
the Cartier map induces an injection (cf. Prop. 6.5 of [Wi3]) 


6:{Pic® bt x Pic? D$} [p] (Fp) g F, > X. 
l p 


The composite 6 o we can be checked to be Hecke invariant (cf. Prop. 6.5 of 
[Wi3]. In checking the compatibility for Up use the formulas of Theorem 5.3 
of [Wi3] but note the correction in [MW1, p. 188].) It follows that 


Ji(Np)mjo (Fp) {m] ~ T/m 
as a T-module. This shows that if H is the Pontrjagin dual of 
H = J\(Np)m/o(Fp) then H ~ Thm since H/m ~ T/m. Thus 
Ji(NP)mjo(Fp) [p] + Hom(Tm/p, Z/pZ). 


Now our assumption that m is Dp-distinguished enables us to identify 


D? = (Np) yoQ) > D? = ANP) oQ). 


For the groups on the right are unramified and those on the left are dual to 
groups where inertia acts via a character of finite order (duality with respect 


to Hom( ,Qp/Zp(1))). So 
D’ ip] > T n/p, D”[p] > Hom (Tm/p, Z/pZ) | 


as T„-modules, the former following from the latter when we use duality under 
the pairing | ,]. In particular as m is Dp-distinguished, 


(2.4) © Dip =Tu/p © Hom(T n/p, Z/pZ). 


We now use an argument of Tilouine [Til]. We pick a complex conjugation 
T. This has distinct eigenvalues +1 on pm so we may decompose D|[p] into 
eigenspaces for T: | 


Dip] = Dipl" © Dipl. 
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Since Tm/p and Hom (Tm/p, Z/pZ) are both indecomposable Hecke-modules, 
by the Krull-Schmidt theorem this decomposition has factors which are iso- 
morphic to those in (2.4) up to order. So in the decomposition 


Dim] = Dim] * @ D[m] 


one of the eigenspaces is T to T/m and the other to (Tm/p)[m]. But 
since pm is irreducible it is easy to see by considering D[m| 6 Hom(D|m], det pm) 
that r has the same number of eigenvalues equal to +1 as equal to —1 in D[m], 
whence #(T,,/p) {m] = #(T/m). This shows that D[m|t > D[m]~ ~ T/m as | 
required. | 

Now we consider the case where A(,) is trivial mod m. This case was 
treated (but only for the group To(Np) and pm ‘new’ at p—the crucial re- 
striction being the last one) in [M Ri]. Let X1(N,p)/q be the modular curve 
corresponding to Iy(N) NTo(p) and let Jı(N, p) be its Jacobian. Then since 
the composite of natural maps Jı(N, p) > Ji(Np) > Ji(N, p) is multiplication 
by an integer prime to p and since Aç(p) is trivial mod m we see that 


Jy(N, p)m(Q) = Ji(Np)m(Q). 


It will be enough then to use Ji(N, p), and the corresponding ring T and ideal 
m. 


The curve Xı(N,p) has a canonical model Xi(N,p)/, which over F, 
p 


consists of two smooth curves XÍ and b# intersecting transversally at the 
supersingular points (again this is a theorem of Deligne and Rapoport; cf. 
[DR, Ch. 6, Th. 6.9], [KM] or [MW1] for more details). We will use the models 
described in [MW1, Ch. II] and in particular the cusp oo will lie on X#. Let 
Q denote the sheaf of regular differentials on Xi(N,p)/p, (cf. [DR, Ch. 1 §2], 
[M Ri, §7]). Over F,, since X1(N, p) F, has ordinary double point singularities, 
the differentials may | be identified with the meromorphic differentials on the 
normalization X(N, (N,p) IF, = = yt U DY which have at most simple poles at the 
supersingular points (the intersection points of the two components) and satisfy 
res,, +res;, = 0 if xı and 2x2 are the two points above such a supersingular 
point. We need the following lemma: 


LEMMA 2.2. diMT /m H®(X1(N P) F 9) [m] = =h 


Proof. First we remark that the action of the Hecke operator Up here is 
most conveniently defined using an extension from characteristic zero. This is 
explained below. We will first show that dimT/m H° (X1 (N, p)/F,, Q) [m] < 1, 
this being the essential step. If we embed T/m—F, and then set 


m = ker: T & F, — F, (the map given by t & a +> atmodm) then it is 
enough to show that dimp, H®(X,(N, p) /F,” Q)[m] < 1. First we will suppose 
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that there is no nonzero holomorphic differential in H°(X1(N,p) iF,” Q) [m], 


i.e., no differential form which pulls back to holomorphic differentials on = 
and ©“. Then if wı and we are two differentials in H°(X1(N, p) iF Q) [m], 
the q-expansion principle shows that ww, — Aw has zero q-expansion at oo for 
some pair (u, A) # (0,0) in F, and thus is zero on D4. As pw, — Aw2 = 0 on 
EH it is holomorphic on XÍ. ‘By our hypothesis it would then be zero which 
shows that wı and wə are linearly dependent. 

This use of the g-expansion principle in characteristic p is wadal and due 
to Mazur [Ma2]. The point is simply that all the coefficients in the q-expansion 
are determined by elementary formulae from the coefficient of q provided that 

w is an eigenform for all the Hecke operators. The formulae for the action of 
these operators in characteristic p follow from the formulae in characteristic 
zero. . To see this formally (especially for the Up operator) one checks first 
that H°(X1(N, p) /Zp» Q), where Q denotes the sheaf of regular differentials on 
'Xı(N, p) /Zp, behaves well under the base changes Zp — F, and Z, > Q,; 
cf. [Ma2, §II.3] or [Wi3, Prop. 6.1]. The action of the Hecke operators on 
- Jı(N, p) induces an action on the connected component of the Neron model of 
Ji(N, p) /Q, 50 also on its tangent space and cotangent space. By Grothendieck 
duality the cotangent space is isomorphic to H?(Xı(N, p) /Zyr Q); see (2.5) 
below. (For a summary of the duality statements used in this context, see 
[Ma2, §II.3]. For explicit duality over fields see [AK, Ch. VIII].) This then 
defines an action of the Hecke operators on this group. To check that over Q, 
this gives the standard action one uses the commutativity of the diagram after 
Proposition 2.2 in [Mil]. 

Now assume that there is a nonzero holenershic differential in 


H? (X(N, p) F, Q) [m]. 


We claim that the space of holomorphic differentials then has dimension 1 and 
that any such differential w Æ 0 is actually nonzero on X”. The dimension 
- claim follows from the second assertion by using the q-expansion principle. To 
prove that w Æ 0 on XF” we use the formula 


Up (z, y) = (Fa, y') 


for (x,y) € (Pic? Ue x Pic? DH)( F,), where F denotes the Frobenius endo- 
morphism. The value of y’ will not be needed. This formula is a variant 
on the second part of Theorem 5.3 of [Wi3] where the corresponding re- 
sult is proved for Xı(Np). (A correction to the first part of Theorem 5.3 
was noted in [MW1, p. 188].) One checks then that the action of Up on 
Xo = H} (X, 9!) 6 HO (£$, Q!) viewed as a subspace of H? (X(N, p) F,» Q) 
is the same as the action on Xo viewed as the cotangent space of Pic? E” x 
Pic? D*. From this we see that if w = 0 on D“ then Upw = 0 on ©*. But Up 
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acts as a nonzero scalar which gives a contradiction if w Æ 0. We can thus as- 
sume that the space of m’-torsion holomorphic differentials has dimension 1 and 
is generated by w. So if w2 is now any differential in H°(X1(N, p) JF,’ Q) [m] 
then w2 — àw has zero q-expansion at oo for some choice of A. Then w— àw = 0 
on X” whence w — Aw is holomorphic and so w = Aw. We have now shown 
in general that dim( H? (X(N, P) F,’ Q) [m’]) <1. 

The singularities of X(N, p) /Z, at the supersingular points are formally 
isomorphic over Zut to Zumt [[X,Y]] /(XY — p*) with k = 1, 2 or 3 (cf. [DR, 
Ch. 6, Th. 6.9]). If we consider a minimal regular resolution Mı (N, p)/z, 
then H°(M:ı(N, p)/F, 9) ~ H? (X(N, p) RF, 9) (see the argument in [Ma2, 
Prop. 3.4]), and a similar isomorphism holds for H? (M(N, p) IZp Q). 

As Mı(N,p)/z, is regular, a theorem of Raynaud [Ray2] says that the 
connected component of the Neron model of Ji(N,p)/q, is Ji(N, P) 12, ~ 


Pic?(M,(N, p) /Z,)- Taking tangent spaces at the origin, we obtain 


(2.5) Tan(Ji(N,p)]z,) = H'(M1(N, p) /Zp Om (wp): 


Reducing both sides mod p and applying Grothendieck duality we get an iso- 
morphism 


(2.6) Tan( J (N, P) ir, ) ~, Hom(H°(X1(N, P)/F,, 9), Fp). 
(To justify the reduction in detail see the arguments in [Ma2, §II. 3]). Since 
Tan(J,(N, P)72,,) is a faithful T @ Zp-module it follows that 
H? (X1 (N, p) jF, 9) [m] 
is nonzero. This completes the proof of the lemma. o 
To complete the proof of the theorem we choose an abelian subvariety 
A of Jı(N,p) with multiplicative reduction at p. Specifically let A be the 


connected part of the kernel of Jı(N, p) — Jı(N) x Ji(N) under the natural 
map Ê described in Section 2 (see (2.10)). Then we have an exact sequence 


0-A-J(N,p) - B-0 


and Jı(N,p) has semistable reduction over Q, and B has good reduction. 
By Proposition 1.3 of [Ma3] the corresponding sequence of connected group 
schemes | 


0—- Alpliz, > Ji(N,p)[pl7z,, > Bleliz, — 0 


is also exact, and by Corollary 1.1 of the same proposition the corresponding 
sequence of tangent spaces of Neron models is exact. Using this we may check 
that the natural map 


(2.7) © Tan(Ji(N,p) ple.) g Tm —> Tan( J (N, p) F) g Tin 
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is an isomorphism, where t denotes the maximal multiplicative-type subgroup 
scheme (cf. [Ma3, §1]). For it is enough to check such a relation on A and B 
separately and on B it is true because the m-divisible group is ordinary. This 
follows from (2.2) by the theorem of Tate [Ta] as before. 

Now (2.6) together with the lemma shows that 


Tan(Ji(N,p))/z, g Tos ee 
p 


We claim that (2.7) together with this implies that as Tm-modules 


V := J(N, p) [p] (Qp)m = (Tm/p). 
To see this it is sufficient to exhibit an isomorphism of Fp-vector spaces 


(2.8) Tan(G F,) ~ G(Q,) g Fr 


for any multiplicative-type group scheme (finite and flat) G/z, which is killed 
by p and moreover to give such an isomorphism that respects the action of 
endomorphisms of G'z,. ‘To obtain such an isomorphism observe that we have 
isomorphisms 


(2.9) Homa (Hp; G) on ~ Homm (Hp, G) g Fr 


R 


Hom (Tam (p/p ), Tan(G jp, ) 
p 


where Homg denotes homomorphisms of the group schemes viewed over Q, 
p 
and similarly for Homg . The second isomorphism can be checked by reducing 
p : 


to the case G = p. Now picking a primitive p? root of unity we can iden- 
tify the left-hand term in (2.9) with G (Q,) g F,. Picking an isomorphism of 
p 


Tan(1, JF) with F, we can identify the last term in (2.9) with Tan(G F) 
Thus after these choices are made we have an isomorphism in (2.8) which 
respects the action of endomorphisms of G. 

On the other hand the action of Gal(Q,/Qp) on V is ramified on every 
subquotient, so V C D®[p]. (Note that our assumption that A(p) is trivial 
mod m implies that the action on D°[p] is ramified on every subquotient and 
on DË[p] is unramified on every subquotient.) By again examining A and B 
separately we see that in fact V = D°[p]. For A we note that Al[p|/Alp]’ is 
unramified because it is dual to A[p]* where A is the dual abelian variety. We 
can now proceed as we did in the case where A(p) was nontrivial modm. O 
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2. Congruences between Hecke rings 


Suppose that q is a prime not dividing N. Let rı(N,q) = T1(N) NTo(q) 
and let X1(N,q) = Xi(N,q)/q be the corresponding curve. The two natural 
maps X1(N,q) — X(N) induced by the maps z — z and z — qz on the 
upper half plane permit us to define a map Ji(N) x A(N) > Ji(N,q). Using 
a theorem of Ihara, Ribet shows that this map is injective (cf. [Ri2, Cor. 4.2]). 
Thus we can define y by 


(2.10) 04 A(N) x A(N)—— J (N, Q). 
Dualizing, we define B by 
Yp Ò . 
0+ B——.y(N, gq) ——-a (N) x iN) > 0. 


Let T1(N,q) be the ring of endomorphisms of Ji(N,q) generated by the 
standard Hecke operators {7}, for l { Nq, Uj, for l | Nq, (a) = (a), for 
(a, Nq) = 1}. One can check that Ug preserves B either by an explicit calcu- 
lation or by noting that B is the maximal abelian subvariety of Jı(N,q) with 
multiplicative reduction at q. We set Jo = A(N) x A(N). z 

More generally, one can consider Jy(N) and Jy(N,q) in place of J;(N) 
and Ji(N,q) (where Jz(N, q) corresponds to X(N, q)/H) and we write Ty(N) 
and Ty(N;q) for the associated Hecke rings. In this case the corresponding 
map y may have a kernel. However since the kernel of Jy(N) > Ji(N) does 
not meet kerm for any maximal ideal m whose associated pm is irreducible, 
the above sequences remain exact if we restrict to m‘)-divisible groups, m‘% 
being the maximal ideal associated to m of the ring TN (N,q) generated by 
the standard Hecke operators but omitting Ug. With 4 this minor modifica- 
tion the proofs of the results below for H Æ 1 follow from the cases of full 
level. We will use the same notation in the general case. Thus y is the map 
Jz = Jy(N)*? — Jy(N,q) induced by z — z and z — qz on the two factors, 
and B = ker ĝ. (B will not be an abelian variety in general.) 

The following lemma is a straightforward generalization of a lemma of 
Ribet ({Ri2]). Let ng be an integer satisfying ng = aN ) and ng = 1(q), and 
write (q) = (nq) E€ TH(Nq). 


LEMMA 2.3 (Ribet). ¥(B)N p(J2) ma = (J2) [U2 — (a) mia for irre- 
ducible pm. | 


Proof. The left-hand side is (imọ N ker), so we compute y~! (imọ N 


ker 9) = ker(ĝ o y). 
An explicit calculation shows that 
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where T* = T, - (q)~*. The matrix action here is on the left. We also find that 
on J . 


(2.11) r00=90| 9 T, 
whence 
(U2-(g))oy = oo| T, ARGI = 


Now suppose that m is a maximal ideal of T(N), p E m and pm is 
irreducible. We will now give a slightly stronger result than that given in the 
lemma in the special case q = p. (The case q # p we will also strengthen but 
we will do this separately.) Assume then that p { N and Tp ¢ m. Let ap be 
the unit root of z? — Tpx + plp) = 0 in TH(N)m. We first define a maximal 
ideal mp of Ty (N, p) with the same associated representation as m. To do this 
consider the ring 


Sı = Tg(N)[U1]/(U? — TpUi + p(p)) E End(Jg(N)’) 


where U; is the endomorphism of Jy(N)? given by the matrix 


It is thus compatible with the action of Up on Jy(N,p) when compared using 
ĝ. Now mı = (m, U1 — Gp) is a maximal ideal of Sı where ap is any element 
of T(N) representing the class Gp E€ Ty(N)m/m ~ Ty(N)/m. Moreover 
Sim, & TH(N)m and we let mp be the inverse image of m; in T g(N,p) under 
the natural map Tg(N,p) — Sı. One checks that mp is Dp EENE For 
any standard Hecke operator t except Up (i.e., t = T}, Uy for q’ . p or (a)) the 
image of t is t. The image of Up is U1. 
We need to check that the induced map 


a: TH(N,p)m — Sim ~ TH(N)m 


is surjective. The only problem is to show that Tọ is in the image. In the present 
context one can prove this using the surjectivity of ¢ in (2.12) and using the 
fact that the Tate-modules in the range and domain of ¢ are free of rank 2 by 
Corollary 1 to Theorem 2.1. The result then follows from Nakayama’s lemma as 
one deduces easily that Ty (N)m is a cyclic T(N, p)m,-module. This argument 
was suggested by Diamond. A second argument using representations can be 
found at the end of Proposition 2.15. We will now give a third and more direct 
proof due to Ribet (cf. [Ri4, Prop. 2]) but found independently and shown to 
us by Diamond. 
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= For the following lemma we let T™ , for an integer M, denote the subring of 
End (S2(T1(N ))) generated by the Hecke operators T,, for positive integers n 


relatively prime to M. Here Sə (T(N )) denotes the vector space of weight 2 
cusp forms on ['|(N). Write T for T}. It will be enough to show that T, is 
a redundant operator in T1, i.e., that TP = T. The result for Ty(N),_ then 
follows. 


LEMMA (Ribet). Suppose that (M,N) =1. If M is odd then T’ =T. 
If M is even then T™ has finite index in T equal to a power of 2. 


_ As the rings are finitely generated free Z-modules, it suffices to prove that 
T™ & F; — T Q F; is surjective unless l and M are both even. The claim 
follows from 


1. TM & F; — T™/? @ F; is surjective if p | M and p + IN. 


2. T! 8 F; > T 9 F; is surjective if l + 2N. 


Proof of 1. Let A denote the Tate module Ta (Jı (N )). Then R = TMP Q 
Z; acts faithfully on A. Let R’ = (R g Qi) N Endz, A and choose d so that 
i 


IŻR' C IR. Consider the Gal(Q/Q)-module B = J (N)[IÌ] x uyu. By 
Cebotarev density, there is a prime q not dividing M NI so that Frob p = Frob q 
on B. Using the fact that T, = Frobr + (rì}r(Frobr)™! on A for r = p and 
r = q, we see that Tp = T, on Jı(N)[lf]. It follows that Tp — T; is in 14 Endz, A 
and therefore in [7R’ C IR. m 


Proof of 2. Let S be the set of cusp forms in S2(T1(N)) whose q-expansions 
at oo have coefficients in Z. Recall that S2(T1(N)) = SQC and that S is stable 
under the action of T (cf. [Sh1, Ch. 3] and [Hi4, §4]). The pairing T8 S > Z 
defined by T & f + aı(Tf) is easily checked to induce an isomorphism of 
T-modules | | 

S = Homz (T, Z). 


The surjectivity of T’/IT’ — T/IT is equivalent to the injectivity of the dual 
map l | | 
Hom(T, F;) ~ Hom(T’, F)). 


Now use the isomorphism S$/1S = Hom(T, F;) and note that if f is in the 
kernel of S — Hom(T", F;), then an(f) = a1(Tnf) is divisible by l for all n 
prime to l. But then the mod l form defined by f is in the kernel of the operator 
qï; and is therefore trivial if l is odd. (See Corollary 5 of the main theorem 
of iKal.) Therefore f is in LS. | 


Remark. The argument does not prove that T”? = T° if (d, N) £1. 
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We now return to the assumptions that pm is irreducible, p { N and 
T, ¢ m. Next we define a principal ideal (Ap) of Ty(N)m as follows. Since 
TH(N,p)m, and T7(N)m are both Gorenstein rings (by Corollary 2 of Theo- 
rem 2.1) we can define an adjoint â to 


Q: T(N, DP) mp ea Siwi ~ TH(N)m 


in the manner described in the appendix and we set A, = (ao @)(1). Then 
(A,) is independent of the choice of (Hecke-module) pairings on Ty(N, P) np 
and Ty(N)m. It is equal to the ideal generated by any composite map 


TH(N)m >? TH(N,P)mp <> TH(N)m 


provided that £ is an injective map of Ty (N, p)m,-modules with Zp torsion-free 
cokernel. (The module structure on Ty(N)m is defined via a.) 


PROPOSITION 2.4. Assume that m is Dy-distinguished and that pm is 
irreducible of level N with p { N. Then 


(Ap) = (T2 - (p)(1+P)”) = (@ — (P)). 
Proof. Consider the maps on p-adic Tate-modules induced by and ¢: 


Tap (Jx(N)”) 2+ Tap (Ju(N,p)) cs Tap (Ju(N)’). 


These maps commute with the standard Hecke operators with the exception 
of Ty or Up (which are not even defined on all the terms). We define 


S2 = Tu(N)[Ua] / (U2 - Tp U2 + p(p)) C End (Ju(N)”) 


where Uz is the endomorphism of Jy(N)? defined by C me, It satisfies 
pU = Upy. Again m2 = (m, U2 — dp) is a maximal ideal of S2 and we have, 
on restricting to the mj, mp and m2-adic Tate-modules: 


Tama (Ja (N)2) 2+ Tamy (Ja(N, p)) E ioa (JaN?) 
(2.12) T2 v2 f Nu 
Tam (Ju (N)) Tam (Ju (N)). 


The vertical isomorphisms are defined by v2: z — (—(p)@,apz) and vı: z > 
(apx, px). (Here ap € Ty(N)m can be viewed as an element of Ty(N)p ~ 
[I] TyH(N), where the product is taken over the maximal ideals containing 
p. So vı and vz can be viewed as maps to Tap (J H(N F) whose images are 
respectively Tam, (Ja(N F) and Tam (J H(N )2).) 

Now @ is surjective and ¢ is injective with torsion-free cokernel by the re- 
sult of Ribet mentioned before. Also Tam (J H(N )) ~ T(N)? and 
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Tamp (Ju(N, p)) ~ T(N, Dae by Corollary 1 to Theorem 2.1. So as y, © 
are maps of Ty(N, p)m,-modules we can use this diagram to compute Ap as 
remarked just prior to the statement of the proposition. (The compatibility 
of the Up actions requires that, on identifying the completions 51m, and S2 mo 
with T7(N)m, we get U1 = U2 which is indeed the case.) We find that 


vj oĝopovz (z) = az! (a2 — (p)) (2). a 


We now apply to J; (N, q*) (but q # p) the same analysis that we have just 
applied to J;(N,p). Here X1(A, B) is the curve corresponding to P;(A)NTo(B) 
and J;(A, B) its Jacobian. First we need the analogue of Ihara’s result. It is 
convenient to work in a slightly more general setting. Let us denote the maps 
X1(Nq"—!,q") — Xi(Nq""*) induced by z > z and z > qz by Tı, and 72,r 
respectively. Similarly we denote the maps Xi(N4q’, gq’ +!) — Xı (Nq) induced 
by z > z and z — qz by 73, and 74, respectively. Also let m : Xi(Nq") > 
X1(Nq"—',q") denote the natural map induced by z > z. 

In the following lemma if m is a maximal ideal of T;(Nq"~+) or Ti(Nq’) 
we use m\% to denote the maximal ideal of T(N q’,q’t') compatible with - 
m, the ring TW) (Nq’, g’t!) C T1(Nq",q"*) being the subring obtained by 
omitting U, from the list of generators. 


LEMMA 2.5. Ifq#>p is a prime andr > 1 then the sequence of abelian 
varieties 


£ £2 
0 > (Ng) —— ANg) x ANT) —— ANg, gt) 
where é = (m,r o 7)*,— (m2, 0 )*) and 2 = (når, n3) induces a corre- 
sponding sequence of p-divisible groups which becomes exact when localized at 
any m for which pm is irreducible. 


Proof. Let T!(N q") denote the group a ) Ef\(N):a=d= 1(q"), 
c=0(q"-1), b= o(a)}. Let Bı and Bt be given by | 


Bı =Ty(Nq")/Ta(Naq")NT(@),  B* =T(Nq") /Ta(Na") NT) 
and let Ag = T,(Nq"1) /T1(Naq") AT (q). Thus Ag ~ SL2(Z/q) if r = 1 and | 
is of order a power of q if r > 1. | | 

_ The exact sequences of inflation-restriction give: 
H! (T(N q"), Qp/Zp)— H> (T(N q") NT (gq), Qo/Zp) 


together with a similar isomorphism with à! replacing \; and B! replacing B1. 
We also obtain 


H1(T1(Nq"~1), Qp/Zp)——- (T1 (Na) NTa), Qp/Zp)**. 
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The vanishing of H*(SL2(Z/q), Q,/ Zp) can be checked by restricting to the 
Sylow p-subgroup which is cyclic. Note that im A,;Nim A! C H! (T(N q")nI(q), 
Q,/Zp)*4 since Bı and B! together generate A,. Now consider the sequence 
(2.13) 0 ——> H'(1i(Nq"~'), Qp/Zp) 


1 


H*(Ti(Nq’), Q,/Zp) © H*(I*(Nq’), Q,/Zp) 
i 
A, HUTANG) ATC), Qp/Zp). 
We claim it is exact. To check this, suppose that àı(x) = —A! (y). Then 
A(x) € H!(Ti(Na") N T(q), Qp/Zp)**. So àı(x) is the restriction of an 
ag! € H! (Ty (Nq"—*), Qp/Zp) whence x — res;(z’) € keràı = 0. It follows 
also that y = — res!(z’). 
Now conjugation by the matrix (3 D Die isomorphisms 


T(N) ~Ti(Nq’), Ty(Nq’) AT) > Ti(Nq r g+), 


So our sequence (2.13) yields the exact sequence of the lemma, except that we 
have to change from group cohomology to the cohomology of the associated 
complete curves. If the groups are torsion-free then the difference between 
these cohomologies is Eisenstein (more precisely T; — 1 — l for l = 1 mod N gr 
is nilpotent) so will vanish when we localize at the preimage of m{% in the 
abstract Hecke ring generated as a polynomial ring by all the standard Hecke 
operators excluding T}. If M < 3 then the group I';(M) has torsion. For 
M = 1,2,3 we can restrict to ['(3), ['(4), T(3), respectively, where the co- 
homology is Eisenstein as the corresponding curves have genus zero and the 
groups are torsion-free. Thus one only needs to check the action of the Hecke 
operators on the kernels of the restriction maps in these three exceptional cases. 
This can be done explicitly and again they are Eisenstein. This completes the 
proof of the lemma. | o 


res; @—res 


Let us denote the maps X;(N,q) — Xı(N) induced by z — z and z — qz 
by mı and 72 respectively. Similarly we denote the maps X1(N,q?) —> X1(N,q) 
induced by z — z and z — qz by 73 and 74 respectively. | 

From the lemma (with r = 1) and Ihara’s result (2.10) we deduce that 
there is a sequence | 


£ 
(2.14) 0 — A(N) x A(N) x A(N) —— JN, ¢@?) 
where € = (71 0 773)* X (T2 0 773)* X (T2 0 74)* and that the induced map of p- 
divisible groups becomes injective after localization at m(%’s which correspond 
to irreducible p,,’s. By duality we obtain a sequence 


é 
Ji(N,q*) —— J (N) — 0 


which is ‘surjective’ on Tate modules in the same sense. More generally we 
can prove analogous results for Jy(N) and Jy(N,q’) although there may be 
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a kernel of order divisible by p in Jy(N) — Jı(N). However this kernel will 
not meet the m(%-divisible group for any maximal ideal m(2) whose associated 
Pm is irreducible and hence, as in the earlier cases, will not affect the results if 
after passing to p-divisible groups we localize at such an m(4), We use the same 
notation in the general case when H # 1 so £ is the map Jy(N)? —> Jy(N, °). 
We suppose now that m is a maximal ideal of T(N) (as always with p € 
m) associated to an irreducible representation and that q is a prime, q { Np. 
We now define a maximal ideal m, of Ty(N, q*) with the same associated 
representation as m. To do this consider the ring | 


Sı = TH(N)[Vi]/Ui(U? — Ty U1 + a(q)) S End (Jn (N)’) 


where the action of U; on Jy(N)? is given by the matrix 


T, —(q) 0 
q 0 0 
0 q 0 


Then U; satisfies the compatibility 

E O Ug =U 1 O E 
One checks this using the actions on cotangent spaces. For we may identify 
the cotangent spaces with spaces of cusp forms and with this identification any 
Hecke operator tą induces the usual action on cusp forms. There is a maximal 
ideal mı = (U1, m) in Sı and Sim, ~ TH(N)m. We let mg denote the reciprocal 
image of mı in Ty(N, q?) under the natural map T(N, q?) > Sı. 

Next we define a principal ideal (Aj) of TH(N)m using the fact that 
Tu(N, q°)m, and TH(N)m are both Gorenstein rings (cf. Corollary 2 to The- 
orem 2.1). Thus we set (Aj) = (@’0a’) where 

a: T(N, q”) sng =z Siini = Ta(N)m 
is the natural map and @’ is the adjoint with respect to selected Hecke-module 
pairings on Ty(N, q?)m, and Ty(N)m. Note that a’ is surjective. To show 
that the T, operator is in the image one can use the existence of the associated 


2-dimensional representation (cf. §1) in which T, = trace(Frobq) and apply 
the Cebotarev density theorem. 


PROPOSITION 2.6. Suppose that m is a maximal ideal of T(N) associ- | 
ated to an irreducible pm. Suppose also that q { Np. Then | 


(At) = (4 — 1) (T? - (a)(1 +”). 


Proof. We prove this in the same manner as we proved Proposition 2.4. 
Consider the maps on p-adic Tate-modules induced by € and €: 


(2.15) Ta, (Ja(N)’) ATi (JH(N, 7) 2, Tap (Ja(N)’) . 
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These maps commute with the standard Hecke operators with the exception 
of T, and U, (which are not even defined on all the terms). We define 


S2 = Ty(N) [Ue] /U2(U3 — Ty U2 + 9(q)) © End (Ju(N)?) 


where Up is the endomorphism of Jy(N) given by the matrix 


0 0 0 
q 0 —(@q) 
0 q Ty 


Then U; € = € U2 as one can verify by checking the equality (E o€)U, =U, (E o£) 
` because E o Ẹ is an isogeny. The formula for E o € is given below. Again 
mz = (m, U2) is a maximal ideal of S2 and S2 m ~ Ty(N)m- On restricting 
(2.15) to the mg, mg and m,-adic Tate modules we get 


m~ 


Tama (JE(N)?) —— Tom (JE(N, q2)) —— Tam, (Ja(N)?) 
(2.16) f u f u, 
Tam (Ji(N)) Tan (Ja (N)). 


The vertical isomorphisms are induced by ug: z — ((q)z,—Tgz,qz) and 
uj: z — (0,0, z). Now a calculation shows that on Jy(N)? 


q(q+1) T,:q Tj? -(a)(1+q) 
(otn Tx -q q(q +1) Ty *4 
Ty? =O Qto Tg q(q+ 1) 


where Të = (q)~*Tq. 
We compute then that 


(uy o E o € ou) =—(q™!\(q—1) (7; — (q)(1+ q)’). 


Now using the surjectivity of € and that £ has torsion-free cokernel in (2.16) 
(by Lemma 2.5) and that Tam (Ja(N)) and Tam, (JH(N, @’)) are each free of 
rank 2 over the respective Hecke rings (Corollary 1 of Theorem 2.1), we deduce 
the result as in Proposition 2.4. oO 


There is one further (and completely elementary) generalization of this 
result. We let m: Xy(Nq,q*) — Xy(N,q?) be the map given by z > z. 
Then 7* : Jy(N,q*) —> Jy(Nq,q°) has kernel a cyclic group and as before 
this will vanish when we localize at m3) if m is associated to an irreducible 
representation. (As before the superscript q denotes the omission of U, from 
the list of generators of Ty (Nq, q?) and m‘ denotes the maximal ideal of 
TP (N q, q?) compatible with m.) 
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We thus have a sequence (not necessarily exact) 
0 — Jy(N)° pease Ju(Nq,q*7) > Z > 0 


where «x = ņ* o € which induces a corresponding sequence of p-divisible groups 
which becomes exact when localized at an m?) corresponding to an irreducible 
Pm- Here Z is the quotient abelian variety Jy(Nq, °) 4 im x. As before there 
is a natural surjective homomorphism 


a. Ty(Nq, a" sts eae: Simi = TH(N)m 


where mg is the inverse image of mı in Ty(Nq, °). (We note that one can 
replace Ty(Nq,q*) by Ty(Nq) in the definition of a and Proposition 2.7 
below would still hold unchanged.) Since both rings are again Gorenstein we 
can define an adjoint @ and a principal ideal : 


(A,) =(a © â). 


PROPOSITION 2.7. Suppose that m is a mazimal ideal of T = Ty(N) 
associated to an irreducible representation. Suppose that q { Np. Then 


(Aq) = (4 - 1)?(T? - (g)(1 +. 4)*)). 
The proof is a trivial generalization of that of Proposition 2.6. 


Remark 2.8. We have included the operator Uj in the definition of Tm, = 
Ty(Nq, a Vin, as in the application of the g-expansion principle it is important 
to have all the Hecke operators. However Ug = 0 in Tm,. To see this we recall 
that the absolute values of the eigenvalues c(q, f) of Uq on newforms of level 
Nq with q { N are known (cf. [Li]). They satisfy c(q, f)? = (q) in Of (the 
ring of integers generated by the Fourier coefficients of f) if f is on T',(N,q), 
and |c(q, f)| = q!/? if f is on T1ı(Nq) but not on Tj(N,q). Also when f is 
a newform of level dividing N the roots of x? — c(q, f)z +qx;(q) = 0 have 
absolute value q'/? where c(q, f) is the eigenvalue of T, and x(q) of (q). Since 
for f on T(Nq, 9°), Ugf is a form on T1(Nq) we see that , 

U, (UZ - (a)) J] Ua- cla, f)) TI (u; — c(q, f)Uq + ala) ) =0 
fES1 fESe2 
in T(Nq, q*) Q C where S; is the set of newforms on I'\(Nq) which are not 


on lı (N,q) and S2 is the set of newforms of level dividing N. In particular as 
Ug is in mą it must be zero in Ting: 


A slightly different situation arises if m is a maximal ideal of T = Ty(N,q) 
(q # p) which is not associated to any maximal ideal of level N (in the sense of 
having the same associated pm). In this case we may use the map £3 = (mä, 73) 
to give 


| és 7 é 
(2.17) Ju(N,q) x Ja(N, 4) —— Ju(N,q2) ——> Ju(N,q) x Ja (N,q). 
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Then és o €3 is given by the matrix 
q Ug 


E3 o €3 = 
U; q 


on Jy(N,q)?, where Uj = U,(q)—* and U? = (q) on the m-divisible group. The 
second of these formulae is standard as mentioned above; cf. for example [Li, 
Th. 3], since pm is not associated to any maximal ideal of level N. For the first 
consider any newform f of level divisible by q and observe that the Petersson 
inner product ( (UU; — 1)f(rz), f(mz)) is zero for any r,m | (Nq/ level f) 
_by [Li, Th. 3]. This shows that UžU¿f(rz), a priori a linear combination of 
f(miz), is equal to f(rz). So UJU} = 1 0n the space of forms on 'y(N,q) 
which are new at q, i.e. the space spanned by forms {f(sz)} where f runs 
through newforms with q | level f. In particular Uj preserves the m-divisible 
group and satisfies the same relation on it, again because pm is not associated 
to any maximal ideal of level N. 7 


Remark 2.9. Assume that pm is of type (A) at q in the terminology of 
Chapter 1, §1 (which ensures that pm does not occur at level N). In this 
case Tm = TH(N, q)m is already generated by the standard Hecke operators 
with the omission of U,. To see this, consider the GL2 (Tm) representation of 
Gal(Q/Q) associated to the m-adic Tate module of Jy (N, q) (cf. the discussion 
following Corollary 2 of Theorem 2.1). Then this representation is already 
defined over the Zp-subalgebra TĦ of Tm generated by the traces of Frobenius 
elements, i.e. by the Ty for £ + Nap. In particular (q) € TĦ. Furthermore, as 
TË is local and complete, and as UŽ = (q), it is enough to solve X 2 = (q) 
in the residue field of Tt. But we can even do this in ko (the minimal field 
of definition of pm) by letting X be the eigenvalue of Frobq on the unique 
unramified rank-one free quotient of kê and invoking the 7, ~ 7(0,) theorem 
of Langlands (cf. [Ca1]). (It is to ensure that the unramified quotient is free 
of rank one that we assume pm to be of type (A).) | 


We assume now that pm is of type (A) at q. Define Sı this time by setting 


Sı = T(N, q)[U1] /Ui(Ui — Ug) S End (Ju(N, @)”) 


where Uj is given by the matrix 


218) o nso 


on Jy(N, q)?. The map és is not necessarily surjective and to remedy this we 
introduce mí?) = m N T? (N,q) where T? (N,q) is the subring of TH(N,q) 
generated by the standard Hecke operators but omitting Ug. We also write md) 
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for the corresponding maximal ideal of TWP (Nq, q?). Then on m®-divisible 
groups, Êz and É> O mą are surjective and we get a natural restriction map of 
localizations Tg (Nq, gi) m0) — Silma): (Note that the image of U, under 
this map is U; and not Ug.) The ideal mı = (m, U1) is maximal in Sı and so also 
in S} (m(a)) and we let mg denote the inverse image of mı under this restriction 
map. The inverse image of m, in Ty(Nq, q?) is also.a maximal ideal which we 
again write mg. Since the completions Ty(N4q, ig oe and Sym, S TH(N, q)m 
are both Gorenstein rings (by Corollary 2 of Theorem 2.1) we can define a 
principal ideal (A,) of T(N, q)m by 


(Aq) = (a 0@) 
where a: Ty(Nq, mg Si ~ T(N, q)m is the restriction map induced 


by the restriction map on m‘%-localizations described above. 


PROPOSITION 2.10. Suppose that m is a mazimal ideal of Ty(N,q) 
associated to an irreducible m of type (A). Then 


(Aa) =(q- 1)? (q+ 1). 


Proof. The method is a straightforward adaptation of that used for Propo- | 
sitions 2.4 and 2.6. We let So = TH(N, q)[U2] / U2(U2 — U,) be the ring of 


endomorphisms of Jy (N, q)? where U2 is given by the matrix 


Uy q 
0 0 


This satisfies the compatibility £;U2 = Ug £3. We define mg = (m, U2) in S2 
and observe that S2,m2 ~ Ty(N,q)m- 
Then we have maps 


Tam (JHN, @)2) TSS Tam (Jur(Na, 9?) ae Tam (JH(N, @)?) 
Ple vo o Teo vy 
Tam(Ja(N, q)) Tam (Jir(N, q)): 


The maps vı and v2 are given by v2: z > (—qz, agz) and vı: z — (z, 0) 
where U; = dg in Ty(N, q)m. One checks then that vj‘ o (£3074) 0(m* 0&3) ovo 
is equal to —(q — 1) (q? — 1) or —3(q— 1)(q? — 1). 

The surjectivity of £307, on the completions is equivalent to the statement 
that | | 

Jy(Nq, q°) [P| mg — Jg(N, q)* [ply 


is surjective. We can replace this condition by a similar one with mla) substi- 
tuted for mg and for mj, i.e., the surjectivity of 


Jy(Nq, q’) [p] mo erR Jg(N, q) [Ph ma) . 
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By our hypothesis that pm be of type (A) at q it is even sufficient to show that 
the cokernel of Jy (Nq, q7)|[p] 8 Fp > Ju (N, q)*[p] $ Fp has no subquotient as 
a Galois-module which is irreducible, two-dimensional and ramified at q. This 
statement, or rather its dual, follows from Lemma 2.5. The injectivity of 7* 0&3 
on the completions and the fact that it has torsion-free cokernel also follows 
from Lemma 2.5 and our hypothesis that pm be of type (A) at q. o 


The case that corresponds to type (B) is similar. We assume in the anal- 
ysis of type (B) (and also of type (C) below) that H decomposes as || H; as 
described at the beginning of Section 1. We assume that m is a maximal ideal 
of Ty(Nq") where H contains the Sylow p-subgroup Sp of (Z/q"Z)* and that 


x Xq 
la 1 


for a suitable choice of basis with xq # 1 and cond Xq = q’. Here q f Np and 
we assume also that pm is irreducible. We use the sa 
(17’)* 0€2 £20 
Ju(Nq") x Ju(Nq") ——> Ju(Nq’,q"*!) ——> mN x Ja (Nq) 


(2.19) "a 


defined analogously to (2.17) where z2 was as defined in Lemma 2.5 and where 
H' is defined as follows. Using the notation H = [[ H; as at the beginning of 
Section 1 set Hi = H; for l # q and H; x Sp = Hy. Then define H’ = |] H; and 
let n’ : Xy (Ng, q+!) —> Xg( Ng, art?) be hé natural map z — z. Using 
Lemma 2.5 we check that £2 is injective on the m()-divisible group. Again we 
set Sı = Ty(Nq") [U1] /U: (U1 — Ug) C End(Jy (Vq")?) where Uj is given by 
the matrix in (2.18). We define mı = (m, U1) and let mg be the inverse image 
of mı in Ty (Nq", g’*'). The natural map (in which Ug — U1) 


Q: Ty (Nd, "aaa oe =R E x TH(Nq' )m 
is surjective by the following remark. 


Remark 2.11. When we assume that pm is of type (B) then the Uq operator 
is redundant in Tm = Ty(Nq")m. To see this, first assume that Tm is reduced 
and consider the GL2(Tm) representation of Gal(Q/ Q) associated to the m- 
adic Tate module. Pick a a, € Iq, the inertia group in D, in Gal(Q/Q), such 
that xg(oq) # 1. Then because the eigenvalues of oq are ee mod m we can 
diagonalize the representation with respect to og. If Frob q is a Frobenius in Dg, 
then in the GL2(T,,) representation the image of Frob q normalizes I4 and we 
can recover U, as the entry of the matrix giving the value of Frobq on the unit 
eigenvector for og. This is by the 7, ~ m(o,) theorem of Langlands as before 
(cf. [Cal]) applied to each of the representations obtained from maps Tm — 
Ofa. Since the representation is defined over the Zp-algebra TË generated by 
the traces, the same reasoning applied to Tt! shows that U, € TH. 
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If T,, is not reduced the above argument shows only that there is an 
operator vg € TĦ such that (U, — vq) is nilpotent. Now Ty(Nq") can be 
viewed as a ring of endomorphisms of S2(['y(Nq")), the space of cusp forms 
of weight 2 on 'y(Nq"). There is a restriction map Ty(Nq") > Ty(Nq")?” 
where T77(Nq")"° is the image of Ty(Nq") in the ring of endomorphisms of 
So(T(NQq"))/S2(Pax(Nq"))°'4, the old part being defined as the sum of two 
copies of S2(T u (Nq"71)) mapped via z — z and z — qz. One sees that on 
m-completions Tm ~ (Ty(Nq")"°”)m since the conductor of pm is divisible by 
q”. It follows that Ug € Tm satisfies an equation of the form P(U,) = 0 where 
P(x) is a polynomial with coefficients in W (km) and with distinct roots. By 
extending scalars to O (the integers of a local field containing W(k,,)) we can 


assume that the roots lie in T Tnu 8 O. 
W (km) 


Since (U, — vq) is nilpotent it follows that P(v,)” = 0 for some r. Then 
since vg € TH which is reduced, P(vg) = 0. Now consider the map T —> IIT,,) 
where the product is taken over the localizations of T at the minimal primes 
p of T. The map is injective since the associated primes of the kernel are all 
maximal, whence the kernel is of finite cardinality and hence zero. Now in 
each Tip), Ug = a; and vg = a; for roots a;, a; of P(x) = 0 because the roots 
are distinct. Since U} — vg € p for each p it follows that a; = a; for each p 
whence Uy = vq in each To). Hence Ug = vq in T also and this finally shows 
that U; € TË in general. 


We can therefore define a principal ideal 
(Aq) = (a o @) 

using, as previously, that the rings Ta: (Nq, q°tt)m, and TH(Nqg")m are Goren- 
stein. We compute (A,) in a similar manner to the type (A) case, but using 
this time that Uj Ug = q on the space of forms on I'y(Nq") which are new at 
q, i.e., the space spanned by forms {f(sz)} where f runs through newforms 
with q” | level f. To see this let f be any newform of level divisible by q” and 
observe that the Petersson inner product ((UjU, —q)f(rz), f (mz) ) = 0 for 
any m | (Nq"/level f) by [Li, Th. 3(ii)]. This shows that (U[U, — q) f (rz), 
a priori a linear combination of {f(m;z)}, is zero. We obtain the following 
result. | | 


PROPOSITION 2.12. Suppose that m is a maximal ideal of Ty(Nq’") | 
associated to an irreducible pm of type (B) atq, i.e., satisfying (2.19) including 
the hypothesis that H contains Sp. (Again q 4 Np.) Then 


(Aq) = ((@- 1)?). 


Finally we have the case where pm is of type (C) at q. We assume then 
that m is a maximal ideal of Ty(Nq") where H contains the Sylow p-subgroup 
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Sp of (Z/q"Z)* and that 
(2.20) H*(Qq, Wy) =0 


where W, is defined as in (1.6) but with pm replacing po, i.e., W) = ad? pm. 
This time we let mg be the inverse image of m in Ty/(Nq") under the 
natural restriction map Ta: (Ng) —> Ty(Nq") with H’ defined as in the 
case of type B. We set 
(A4) = (œo â) 


where a: Ty(Nq")m,>TH(Nq")m is the induced map on the completions, 
which as before are Gorenstein rings. The proof of the following proposition 
is analogous (but simpler) to the proof of Proposition 2.10. (Notice that the 
proposition does not require the condition that pm satisfy (2.20) but this is the 
case in which we will use it.) 


PROPOSITION 2.13. Suppose that m is a mazimal ideal of Tu(N q") asso- 
ciated to an irreducible pm with H containing the Sylow p-subgroup of (Z/q"Z)*. 
Then 

(44) = (q -1). 


Finally, in this section we state Proposition 2.4 in the case q Æ p as this 
will be used in Chapter 3. Let q be a prime, q t Np and let Sı denote the ring 


(221) TH(N) [Ui] AU? - TeV + (a)q} C End(Ja(N)’) 
where ĝ: Jy(N,q) > Jy(N)? is the map defined after (2.10) and Uj is the 


matrix 
| Ty —(q) | 
q 0 j? 


Thus, Uq = Uig. Also (q) is defined as (ng) where ng = g(N), ng = 1(Q). 
Let mı be a maximal ideal of Sı containing the image of m, where m is a 
maximal ideal of T(N) with associated irreducible pm. We will also assume 
that pm(Frobq) has distinct eigenvalues. (We will only need this case and 
it simplifies the exposition.) Let mg denote the corresponding maximal ide- 
als of Ty(N,q) and Ty(Nq) under the natural restriction maps Ty(Nq) > 
TH(N,q) — Sı. The corresponding maps on completions are 


(2.22) Ti(Nq)me > TH(N,q)m 


= Sim = TH(N)m 8 W (kT) 
W (km) 
where kt is the extension of km generated by the eigenvalues of {,,(Frob q)}. 
Thus kt is either equal to km or its quadratic extension. The maps (7, a are 
surjective, the latter because Tq is a trace in the 2-dimensional representation 
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to GLo(TH(N)m) given after Theorem 2.1 and hence is ‘redundant’ by the 
Cebotarev density theorem. The completions are Gorenstein by Corollary 2 to 
Theorem 2.1 and so we define invariant ideals of Sim, 


(2.23) (A)=(a0d),  (A') = (ao 8) o (a8). 

Let aq be the image of U1 in Ty(N)m 7 W (kt) under the last isomorphism 
W (km 

in (2.22). The proof of Proposition 2.4 yields 


PROPOSITION 2.4'. Suppose that pm is irreducible where m is a mazimal 
ideal of Ty(N) and that pm(Frobq) has distinct eigenvalues. Then 


(a? — {q)); 
(az — (g))(q—- 1). 


È 
Il 


(A9 


Remark. Note that if we suppose also that q = 1 (p) then (A) is the unit 
ideal and a is an isomorphism in (2.22). 


3. The main conjectures 


As we suggested in Chapter 1, in order to study the deformation theory 
of po in detail we need to assume that it is modular. That this should always 
be so for det pọ odd was conjectured by Serre. Serre also made a conjecture 
(the ‘e’-conjecture) making precise where one could find a lifting of pọ once 
one assumed it to be modular (cf. [Se]). This has now been proved by the 
combined efforts of a number of authors including Ribet, Mazur, Carayol, 
Edixhoven and others. The most difficult step was to show that if pọ was 
unramified at a prime / then one could find a lifting in which l did not divide 
the level. This was proved (in slightly less generality) by Ribet. For a precise 
statement and complete references we refer to Diamond’s paper [Dia] which 
removed the last restrictions referred to in Ribet’s survey article [Ri3]. The 
following is a minor adaptation of the epsilon conjecture to our situation which 
can be found in [Dia, Th. 6.4]. (We wish to use weight 2 only.) Let N (po) be 
the prime to p part of the conductor of pọ as defined for example in [Se]. 


THEOREM 2.14. Suppose that po is modular and satisfies (1.1) (so in 
particular is irreducible) and is of type D = (-,X,O,M) with - = Se, str or fl. 
Suppose that at least one of the following conditions holds (i) p > 3 or (ii) po 
is not induced from a character of Q(./—3). Then there exists a newform f 
of weight 2 and a prime X of Oy such that psy is of type D’ = (-,,0',M) 
for some O’, and such that (pf mod A) ~ po over Fp. Moreover we can 
assume that f has character xf of order prime to p and has level N (po) po) 
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where (po) = 0 if po|D, is associated to a finite flat group scheme over Zp 


and det po| = w, and 6(p9) = 1 otherwise. Furthermore in the Selmer case 
P 


we can assume that ap( f) = x2(Frobp) mod A in the notation of (1.2) where 
ap( f) is the eigenvalue of Up. 


For the rest of this chapter we will assume that pọ is modular and that 
if p = 3 then po is not induced from a character of Q(./—3). Here and in the 
rest of the paper we use the term ‘induced’ to signify that the representation 
is induced after an extension of scalars to the algebraic closure. | 

For each D = {-,,O,M} we will now define a Hecke ring Tp except 
where - is unrestricted. Suppose first that we are in the flat, Selmer or strict 
cases. Recall that when referring to the flat case we assume that pg is not 
ordinary and that det po| mies Suppose that © = {q;} and that N (po) 


| Ig; ‘ with s; > 0. If Uy ~ k? is the representation space of po We set ng = 
dim; (U))’* where I, is the inertia group at q. Define Mo and M by 


(2.24) Mo=N(oo) [I «a: [[ a, M=Mop™ 
Ng, =2 


qı Mu{p} 


where T(po) = 1 if po is ordinary and 7(p9) = 0 otherwise. Let H be the 
subgroup of (Z/MZ)* generated by the Sylow p-subgroup of (Z/q;Z)* for each 
qi E M as well as by all of (Z/q;Z)* for each q; E€ M of type (A). Let T'y(M) 
denote the ring generated by the standard Hecke operators {T; for l 4 Mp, (a) 
for (a, Mp) = 1}. Let w denote the maximal ideal of T’,,(M) associated to the 
f and À given in the theorem and let kw be the residue field T,(M)/m’. Note 
that m’ does not depend on the particular choice of pair (f, A) in theorem 2.14. 
Then kw œ ko where ko is the smallest possible field of definition for pọ because 
kw is generated by the traces. Henceforth we will identify kg with kw. There 
is one exceptional case where po is ordinary and po|p, is isomorphic to a sum 
of two distinct unramified characters (xı and x2 in the notation of Chapter 1, 
§1). If po is not exceptional we define 


(2.25(a)) Tp = Ty (M) ey O. 


Q 
W (ko) 
If po is exceptional we let T} (M) denote the ring generated by the operators 
{T; for l | Mp, (a) for (a, Mp) = 1, Up}. We choose m” to be a maximal 
ideal of T% (M) lying above m’ for which there is an embedding kw <> k (over 
ko = kw) satisfying Up —> x2(Frobp). (Note that x2 is specified by D.) Then 
in the exceptional case k,,” is either ko or its quadratic extension and we define 
(2.25(b)) Tp = Ty(M)w 7 . (OF 


ml!) 


The omission of the Hecke operators U, for q | Mo ensures that Tp is reduced. 
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We need to relate Tp to a Hecke ring with no missing operators in order 
to apply the results of Section 1. 


PROPOSITION 2.15. In the nonexceptional case there is a maximal ideal 
m for Ty(M) with mN T'y(M) = w and ko = km, and such that the natural 
map Ty(M) qv —> Ty(M)m is an isomorphism, thus giving 


W (ko) 


In the exceptional case the same statements hold with m” replacing m’, T7,(M) 
replacing T4,(M) and kw replacing ko. 


Proof. For simplicity we describe the nonexceptional case indicating where 
appropriate the slight modifications needed in the exceptional case. To con- 
struct m we take the eigenform fo obtained from the newform f of Theorem 2.14 
by removing the Euler factors at all primes q € & — {M Up}. If po is ordinary 
and f has level prime to p we also remove the Euler factor (1 — 3,-p~*) where 
Bp is the non-unit eigenvalue in Oy). (By ‘removing Euler factors’ we mean 
take the eigenform whose L-series is that of f with these Euler factors re- — 
moved.) Then fo is an eigenform of weight 2 on T y(M) (this is ensured by the 
choice of f) with O; coefficients. We have a corresponding homomorphism 
Tf: TH(M) — Of, and we let m = T7 (A). | 

Since the Hecke operators we have used to generate T’,,(M) are prime to 
the level there is an inclusion with finite index 


Ti (M)—> |] Og 


where g runs over representatives of the Galois conjugacy classes of newforms 
associated to l'y (M) and where we note that by multiplicity one O, can also be 
described as the ring of integers generated by the eigenvalues of the operators 
in Ty(M) acting on g. If we consider Ty(M) in place of T4,(M) we get a 
similar map but we have to replace the ring O, by the ring 


Sg = Og|Xqis-- +> Xan: Xp) / {Yi Zp hi=l 
where {p, qi,--..,@,r} are the distinct primes dividing Mp. Here 
(2.26) y. = A (Xa — Qq; (9)) (Xa — Bq, (9)) if q; { level(g) 

AG; (Xa — Ag, (9)) if q | level(g), 


where the Euler factor of g at q; (ie., of its associated L-series) is 
(1—aq,(9)9; °?) (1— Pa. (9)q; °) in the first case and (1— ag, (g)q; °) in the second 
case, and q;‘|| (m / level(g)). (We allow ag,(g) to be zero here.) Similarly Zp is 
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defined by 
X2 —ap(9)Xp+pxg(p) ifp| M, p t level(g) 
Zp = $ Xp — ap(Q) if p + M 
Xp — ap(g) if p | level(g), 
where the Euler factor of g at p is (1 — ap(g)p-* + xg(p) p' °°) in the first 


two cases and (1 — a,(g)p~*) in the third case. We then have a commutative 
diagram 


Ty(M) Cœ | II, O 


(2.27) f 


TH(M) C= 11 Sy = [1 OolXa -+ Xar Xpl/ (Yis Zoia 


where the lower map is given on {Uy,, Up or Tp} by Uy — Xy, Up or 
T, — Xp (according as p | M or p 4 M). To verify the existence of such 
a homomorphism one considers the action of Ty(M) on the space of forms of 
weight 2 invariant under I'y7(M) and uses that )"_) 9;(m,z) is a free gener- 
ator as a Ty(M) @ C-module where {g;} runs over the set of newforms and 
= M/level(g;). | 

Now we tensor all the rings in (2.27) with Z,. Then completing the top 
row of (2.27) with respect to mw and the bottom row with respect to m we get 
a commutative diagram 


Ty Mow —+ (Cs) = Y One 


(2.28) | | | 


m! — ys 
TH(M)m — (IIS), = So) 
Here u runs through the or above p in each Ox for which m’ > u under 
Ty (M) > Og. Now (Sg)m is given by 
(2.29) (Sg@Zp)m ~ ((Og 8 Zp) [Xa -Xa Xpl /{¥is Zp} 


(1 Og gum | Xq». A ges Xp) / {Yi, Zain) 


ulp 


0 7 


where Ag „ denotes the product of the factors of the complete semi-local ring 
Oil CAERE, CES. E1 / {Y;, Zp}%_, in which Xq, is topologically nilpotent for 


K 


m 


I 
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qi £ M and in which X, is a unit if we are in the ordinary case (i.e., when 
p | M). This is because Us, € m if q; € M and U, is a unit at m in the ordinary 
case. 

Now if m’ — u then in (Ag.,)m we claim that Y; is given up to a unit by 
Xq, — bi for some b; E€ Og, with b; = 0.if q; € M. ery Zp is given up to.a 
unit by Xp — ap(g) where ap(g) is the unit root of z? — a + pxX9(p) = 0 in 
Ogu if p t level g and p | M and by Xp — ap(g) if p | level g or p 4 M. This 
will show that (Ag, u)m ~ Og, when m’ > u and (Ag, p)m = 0 otherwise. 

For q; € M and for p, the claim is straightforward. For q; ¢ M, it amounts 
to the following. Let U,,,, denote the 2-dimensional K,,,-vector space with 
Galois action via pg,, and let ng,(g, 4) = dim(U,,,,)’%. We wish to check that 
Y; = unit. Xq; in (Ag,,)m and from the definition of Y; in (2.26) this reduces 
to checking that r; = ng; (g, 4) by the tg =~ m(o,) theorem (cf. [Cal]). We use 
here that a9;(g), Bq: (g) and aq; (g) are p-adic units when they are nonzero since 
they are eigenvalues of Frobq;. Now by definition the power of q; dividing M 
_ is the same as that dividing N (00) 45" (cf. (2.21)). By an observation of Livné 
(cf. [Liv], [Ca2, §1]), 


(2.30) ord; (level g) = ordą, (N (p0)q;% ™“ (944) ) 


As by definition q;*||(M/levelg) we deduce that r; = ng; (g, 4) as required. 
We have now shown that each Ay, ~ O,,, (when w —> p) and it follows 
from (2.28) and (2.29) that we have homomorphisms 


Ta(M)w — TH(M)m — [[ Onn 
| P J u 

where the inclusions are of finite index. Moreover we have seen that Uy, = 0 
in Ty(M )m for qi ¢ M. We now consider the primes q; € M. We have 
to show that the operators U, for q E€ M are redundant in the sense that 
they lie in Ty (M)w, i.e., in the Zy-subalgebra of T7(M),_ generated by the 
{Tj}: | 4 Mp, (a): a € (Z/MZ)* }. For q € M of type (A), Ug € Ty (M) a 
as explained in Remark 2.9 and for q E€ M of type (B), U, € va) )w as 
explained in Remark 2.11. For q E€ M of type (C) but not. of type (A), U. ea 0 
by the mg ~ (oq) theorem (cf. [Cal]). For in this case ng = 0 whence also 
Nq(g, u) = 0 for each pair (g, u) with m’ — u. If po is strict or Selmer at p then 
Up can be recovered from the two-dimensional representation p (described after 
the corollaries to Theorem 2.1) as the eigenvalue of Frob p on the (free, of rank 
one) unramified quotient (cf. Theorem 2.1.4 of [Wil]). As this representation 
is defined over the Z,-subalgebra generated by the traces, it follows that Up 
is contained in this subring. In the exceptional case U, is in Ti (M) ma by 
definition. 

Finally we have to show that Tp is also redundant in the sense explained 
above when p ł M. A proof of this has already been given in Section 2 (Ribet’s 
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lemma). Here we give an alternative argument using the Galois representa- 
tions. We know that Tp € m and it will be enough to show that Tp € (m7, p). 
Writing km for the residue field Ty(M),/m we reduce to the following situa- 
tion. If T, ¢ (m?, p) then there is a quotient 


TH(M)m/(m”, p)>kmle] = TH(M)m/a 
where km{é] is the ring of dual numbers (so £? = 0) with the property that 


T, + re with À Æ 0 and such that the image of T'y(M)w lies in km. Let G/q 
denote the four-dimensional k,,-vector space associated to the representation 


pe: Gal(Q/Q) — GL2 (kmle]) 


induced from the representation in Theorem 2.1. It has the form 
GQ ~ Gog p Gog 


where Go is the corresponding space associated to po by our hypothesis that 
the traces lie in km. The semisimplicity of G/q here is obtained from the main 
theorem of [BLR]. Now Gyq, extends to a finite flat group scheme G/z,,. 
Explicitly it is a quotient of the group scheme Jy(M) m[p|/z,- Since extensions 
to Zp are unique (cf. [Ray1]) we know | 


G SG G ' 
E /Zp 0/ Zp B Go/ Zp 
Now by the Eichler-Shimura relation we know that in Jy (M)/p 
Pp 


T, = F + (p}F". 


Since T, € m it follows that F + (p) FT = 0 on Go IF and hence the same holds 
on Gf,- But Tp is an endomorphism of G [Zo which is zero on the special 
fibre, so by [Rayl, Cor. 3.3.6], Tp = 0 on G7 z,. It follows that Tp = 0 in kmlE] 
which contradicts our earlier hypothesis. So Tp € (m?, p) as required. This 
completes the proof of the proposition. o 


From the proof of the proposition it is also clear that m is the unique max- 
imal ideal of Ty(M) extending w and satisfying the conditions that Uq E€ m 
for q € E — {MU p} and Up ¢ m if po is ordinary. For the rest of this chapter 
we will always make this choice of m (given po). 

Next we define Tp in the case when D = (ord, £, O, M). If n is any 
ordinary maximal ideal (i.e. Up n) of Ty(Np) with N prime to p then Hida 
has constructed a 2-dimensional Noetherian local Hecke ring 


To =e Tu(Np”)n := lim e TH(Np")n, 


which is a A = Z,|[T]-algebra satisfying T../T ~ Ty(Np)n. Here n, is the 
inverse image of n under the natural restriction map. Also T = lim (1 +Np)-1 
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and e = lim U: r! For an irreducible pọ of type D we have defined Tp in 
(2.25(a)), wiere D' = (Se, £, O, M), b 
Tp = atk & O, 
W (km) 


m 


the isomorphism coming from Proposition 2.15. We will define Tp by 


(2.31) Tp = eT y(Mop™) m & O. 
W (km) 


In particular we see that 
(2.32) Tp/T ~ Tp, 


i.e., where D’ is the same as D but with ‘Selmer’ replacing ‘ord’. Moreover if 
q is a height one prime ideal of Tp containing (a +T” - (14N pee) 
for any integers n > 0, k > 2, then Tp/q is associated to an eigenform in a 
natural way (generalizing the case n = 0, k = 2). For more details about these 
rings as well as about A-adic modular forms see for example [Wil] or [Hi1]. 

For each n > 1 let Tan = Ty(Mop")m,. Then by the argument given 
after the statement of Theorem 2.1 we can construct a Galois representation pn 
unramified outside Mp with values in GL2(T,,) satisfying trace p,(Frob1) = Th, 
det p,(Frobl) = (l) for (l; Mp) = 1. These representations can be patched 
together to give a continuous representation 


(2.33) p = lim pp: Gal(Qn/Q) — GL2(Tp) 


where » is the set of primes dividing Mp. To see this we need to check the 
commutativity of the maps 


Ry ———> T, 


N| 


Tr-1 


-where the horizontal maps are induced by pn and pn—1 and the vertical map is 
- the natural one. Now the commutativity is valid on elements of Ry, which are 
traces or determinants in the universal representation, since trace (Frob1) +> T, 
under both horizontal maps and similarly for determinants. Here Ry is the 
universal deformation ring described in Chapter 1 with respect to po viewed 
with residue field k = ky. It suffices then to show that Ry is generated (topo- 
logically) by traces and this reduces to checking that there are no nonconstant 
deformations of po to k[e] with traces lying in k (cf. [Ma1, §1.8]). For then if RE 
denotes the closed W(k)-subalgebra of Ry generated by the traces we see that 
R€ — (Ry/m*) is surjective, m being the maximal ideal of Rs, from which 
we easily conclude that R€! = Ry. To see that the condition holds, assume 
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that a basis is chosen so that po (c) = = G aa) for a chosen complex conjugation 
c and polo) = ey 2) with bs = 1 and cg Æ 0 for some ø. (This is possible 
because po is irreducible.) Then any deformation [p] to k[e] can be represented 
by a representation p such that p(c) and p(o) have the same properties. It 
follows easily that if the traces of p lie in k then p takes values in k whence 
it is equal to po. (Alternatively one sees that the universal representation can 
be defined over RË by diagonalizing complex conjugation as before. Since the 
two maps R= — T,-1 induced by the triangle are the same, so the associ- 
ated representations are equivalent, and the universal property then implies 
the commutativity of the triangle.) | 
| The representations (2.33) were first exhibited by Pida and were the orig- 
inal inspiration for Mazur’s deformation theory. 
For each D = {-,4,0,M} where - is not ure cted there is then a- 
canonical surjective map 
yp: Rp > Tp 


which induces the representations described after the corollaries to Theorem 2.1 
and in (2.33). It is enough to check this when O = W (ko) (or W (kw) in the 
exceptional case). Then one just has to check that for every pair (g, p) which 
appears in (2.28) the resulting representation is of type D. For then we claim 
that the image of the canonical map Rp — Tp = 110, is Tp where here ~ 
denotes the normalization. (In the case where - is ord this needs to be checked 
instead for T, a O for each n.) For this we just need to see that Rp is 


generated by traces. (In the exceptional case we have to show also that Up is 
in the image. This holds because it can be identified, using Theorem 2.1.4 of 
[Wil], with the image of u € Rp where u is the eigenvalue of Frobp on the 
unique rank one unramified quotient of R32, with eigenvalue = x2(Frob p) which 
is specified in the definition of D.) But we saw above that this was true for 
Ry. The same then holds for Rp as Rg —> Rp is surjective because the map 
on reduced cotangent spaces is surjective (cf. (1.5)). To check the condition 
on the pairs (g, p) observe first that for q E€ M we have imposed the following 
conditions on the level and character of such g’s by our choice of M and H: 


q of type (A): q|| level g, det Powel =l; 
q 
q of type (B): cond xg|| level g, det Pail, = Xq 
q 
q of type (C): det Poul is the Teichmüller lifting of det po : 
q q 


In the first two cases the desired form of py, alt then follows from the 


Tg (oq) theorem of Langlands (cf. [Cal]). The third case is already of 
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type (C). For q = p one can use Theorem 2.1.4 of [Wil] in the ordinary case, 
the flat case being well-known. 

The following conjecture generalizes a fundamental conjecture of Mazur 
and Tilouine for D = (ord, X, W (ko), $); cf. [MT]. 


CONJECTURE 2.16. yp is an isomorphism. 


Equivalently this conjecture says that the representation described after 
the corollaries to Theorem 2.1 (or in (2.33) in the ordinary case) is the universal 
one for a suitable choice of H, N and m. We remind the reader that throughout 
this section we are assuming that if p = 3 then po is not induced from a 


character of Q(./—3). 


Remark. The case of most interest to us is when p = 3 and pọ is a rep- 
resentation with values in GL2(F3). In this case it is a theorem of Tunnell, 
extending results of Langlands, that pọ is always modular. For GL2(F3) is a 
double cover of S4 and.can be embedded in GLo(Z[/—2]) whence in GL2(C)); 
cf. [Se] and [Tu]. The conjecture will be proved with a mild restriction on pọ 
at the end of Chapter 3. 


Remark. Our original restriction to the types (A), (B), (C) for po was. 
motivated by the wish that the deformation type (a) be of minimal conduc- 
tor among its twists, (b) retain property (a) under unramified base changes. 
Without this kind of stability it can happen that after a base change of Q to an 
extension unramified at X, pọ ® w has smaller ‘conductor’ for some character 
p. The typical example of this is where po| = Ind}? (x) with q = —1(p) and 


x is a ramified character over K, the unramified quadratic extension of Q4. 
What makes this difficult for us is that there are then nontrivial ramified local 
deformations (Ind,*?€ for € a ramified character of order p of K) which we 
cannot detect by a change of level. 


For the purposes of Chapter 3-it is convenient to digress now in order to 
introduce a slight variant of the deformation rings we have been considering 
so far. Suppose that D = (-, X, ©, M) is a standard deformation problem 
(associated to po) with - = Se, str or fl and suppose that H, Mo, M and m 
are defined as in (2.24) and Proposition 2.15. We choose a finite set of primes 
Q = {q,...,¢r} with q; { Mp. Furthermore we assume that each q; = 1(p) 
and that the eigenvalues {a;, 8i} of po(Frobq;) are distinct for each q; E€ Q. 
This last condition ensures that pp does not occur as the residual representation 
of the A-adic representation associated to any newform on [y(M,q..... qr) 
where any q; divides the level of the form. This can be seen directly by looking 
at (Frobq;) in such a representation or by using Proposition 2.4’ at the end of 
Section 2. It will be convenient to assume that the residue field of O contains 
Qi, Bi for each qi. 
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Pick a; for each i. We let Dg be the deformation problem associated to 
_representations p of Gal(Qsug/Q) which are of type D and which in addition 
satisfy the property that at each q; E€ Q 


X1, i i 
(2.34) Ply ~ ( a 


with x29, unramified and x2,¢,(Frobq;) = a; mod m for a suitable choice of 
basis. One checks as in Chapter 1 that associated to Dg there is a universal 
deformation ring Rg. (These new conditions are really variants on type (B).) 

‘We will only need a corresponding Hecke ring in a very special case and it 
is convenient in this case to define it using all the Hecke operators. Let us now 
set N = N(po)p°\) where 5(po) is as defined in Theorem 2.14. Let mg denote 
a maximal ideal of Ty(N) given by Theorem 2.14 with the property that 
Pm œ po over F, relative to a suitable embedding of km, — k over ko. (In the 
exceptional case we also impose the same condition on mg about the reduction 
of U, as in the definition of Tp in the exceptional case before (2.25)(b).) Thus 
Pmo Pf,» mod A over the residue field of Of) for some choice of f and A 
with f of level N. By dropping one of the Euler factors at each q; as in the 
proof of Proposition 2.15, we obtain a form and hence a maximal ideal mg of 
TH(Nq ...qr) with the property that pmo ~ po over F, relative to a suitable 
embedding kmo — k over km. The field kmo is the extension of ko (or kw” in 
the exceptional case) generated by the a;, Bi. We set 


(2.35) To =TuH(Na-.-dr)mo ® 


” W (king ) 
It is easy to see directly (or by the arguments of Proposition 2.15) that 
Tg is reduced and that there is an inclusion with finite index 


(2.36) : To = To = I] Ogu 


where the product is taken over representatives of the Galois conjugacy classes 
of eigenforms g of level Nqi...q- with mg — u. Now define Dg using the 
choices a; for which U,, —> a; under the chosen embedding kmo — k. Then 
- each of the 2-dimensional representations associated to each factor Og, is of 
type Dg. We can check this for each q E€ Q using either the 7, ~ n(oq) 
theorem (cf. [Cal]) as in the case of type (B) or using the Eichler-Shimura 
relation if g does not divide the level of the newform associated to g. So we get 
a homomorphism of O-algebras Rg —> To and hence also an O-algebra map 


(2.37) - pQ: Ro > Ta 


as Rg is generated by traces. This is not an isomorphism in general as we 
have used N in place of M. However it is surjective by the arguments of 
Proposition 2.15. Indeed, for q | N(po)p, we check that Ug is in the image of 
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yg using the arguments in the second half of the proof of Proposition 2.15. 
For q E€ Q we use the fact that U, is the image of the value of x2,,.(Frob q) 
in the universal representation; cf. (2.34). For q | M, but not of the previous 
types, Tq is a trace in pp, and we can apply the Cebotarev density theorem 
to show that it is in the image of pQ. 

Finally, if there is a section 7: Tg — O, then set po = kerm and let pp de- 
note the 2-dimensional representation to GL2(O) obtained from py, mod pq. 
Let V = Ad ø g K/O where K is the field of fractions of O. We pick a basis 


for pp Satisfying (2.34) and then let 


(2.38) ya) G >) 
a b 
i: i )iahadeo}eK/o 


and let Vig.) = V/ V(%), Then as in Proposition 1.2 we have an isomorphism 


(2.39) Homo (prg/Pig> K/O) ~ Hbo (Q2ug/Q, V) 


where pro = ker(m o pq) and the second term is defined by 


C Ad pp &K/O 


(2.40) Hp, (Qzug/Q, V) = ker: Hp(Qzug/Q,V) > I] H’? (Q> Via): 


We return now to our discussion of Conjecture 2.16. We will call a de- 
formation theory D minimal if & = M U {p} and - is Selmer, strict or flat. 
This notion will be critical in Chapter 3. (A slightly stronger notion of mini- 
mality is described in Chapter 3 where the Selmer condition is replaced, when 
possible, by the condition that the representations arise from finite flat group 
schemes—see the remark after the proof of Theorem 3.1.) Unfortunately even 
up to twist, not every po has an associated minimal D even when fp is flat or 
ordinary at p as explained in the remarks after Conjecture 2.16. However this 
could be achieved if one replaced Q by a suitable finite extension depending 
on po. | | 
Suppose now that f is a (normalized) newform, AÀ is a prime of Of above p 
and pf, a deformation of po of type D where D = (-, 5, Ofa, M) with - = Se, 
str or fl. (Strictly speaking we may be changing pọ as we wish to choose its 
field of definition to be k = O;,/X.) Suppose further that level(f) | M where 
M is defined by (2.24). | 

Now let us set O = Of for the rest of this section. There is a homomor- 
phism | 


(2.41) | T =rpf:Tp> 0 
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whose kernel is the prime ideal py ¢ associated to f and à. Similarly there is 


a homomorphism 
| Rp — O 


whose kernel is the prime ideal pr associated to f and A and which factors 
through wf. Pick perfect pairings of O-modules, the second one T’p-bilinear, 


(2.42) OxO-0, (,):TpxTp—-O. 


In each case we use the term perfect pairing to signify that the pairs of induced 
maps O — Homo (O, O) and Tp — Homo (Tp, ©) are isomorphisms. In 
addition the second one is required to be T:p-linear. The existence of the second 
pairing is equivalent to the Gorenstein property, Corollary 2 of Theorem 2.1, 
as we explain below. Explicitly if h is a generator of the free Tp-module 
Homo(Tp, O) we set (ty, t2) - h(tıt2). 

A priori Ty(M)m (occurring in the description of Tp in Proposition 2.15) 
is only Gorenstein as a Z,-algebra but it follows immediately that it is also a 
Gorenstein W(k,)-algebra. (The notion of Gorenstein O-algebra is explained 
in the appendix.) Indeed the map 


Homy(i,) (TH(M)m, W(km)) — Homz, (TH(M)m, Zp) 


given by y + traceoy is easily seen to be an isomorphism, as the reduction 
mod p is injective and the ranks are equal. Thus Tp is a Gorenstein O-algebra. 

Now let # : O — Tp be the adjoint of 7 with respect to these pairings. 
Then define a principal ideal (7) of Tp by 


(n) = (np,5) = (#1). 
This is well-defined independently of the pairings and moreover one sees that 
Tp/n is torsion-free (see the appendix). From its description (7) is invariant 
under extensions of O to O’ in an obvious way. Since Tp is reduced m(n) Æ 0. 
One can also verify that | 


(2.43) n(n) = (n,n) 


up to a unit in O. 

We will say that Dı D D if we obtain Dı by relaxing certain of the 
hypotheses on D, i.e., if D = (, X, O, M) and Di = (-, X1, O1, M1) we allow 
that 5; D E, any O1, M D M, (but of the same type) and if - is Se or str 
in D it can be Se, str, ord or unrestricted in Dj, if - is fl in D, it can be fl 
or unrestricted in Dı. We use the term restricted to signify that - is Se, str, 
fl or ord. The following theorem reduces conjecture 2.16 to a ‘class number’ 
criterion. For an interpretation of the right-hand side of the inequality in 
the theorem as the order of a cohomology group, see Proposition 1.2. For an 
interpretation of the left-hand side in terms of the value of an inner product, 
see Proposition 4.4. 
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THEOREM 2.17. Assume, as above, that pf is a deformation of po of 
type D = (X, O = Oy, M) with - = Se, str or fl. Suppose that 
#O/n(np,5) = #Prz/PRz- 
Then | 
(i) yp,:Rp, ~ Tp, is an isomorphism for all (restricted) D, D D. 


li) Tp, ts a complete intersection (over O, if - is Se, str or fl) for all re- 
1 
stricted Dı D D. 


Proof. Let us write T for Tp, pp for PT, fo PR for pry and 7 for np Fz. 
Then we always have 


(2.44) #O/n < #pr/py. 

(Here and in what follows we sometimes write 7 for m(n) if the context makes 
this reasonable.) This is proved as follows. T/n acts faithfully on pp. Hence 
the Fitting ideal of pp as a T/n-module is zero. The same is then true of 
pr/p, as an O/n = (T/n)/pr-module. So the Fitting ideal of pp/p2 as an 
O-module is contained in (7) and the conclusion is then easy. So together with 
the hypothesis of the theorem we get inequalities (and hence equalities) 


# O/n(n) > #pr/Ph > #pr/p> > #O/n(n). 
_ By Proposition 2 of the appendix T is a complete intersection over O. Part (ii) 
of the theorem then follows for D. Part (i) follows for D from Proposition 1 of 


the appendix. 
We now prove inductively that we can Jedi the same inequality 


(2.45) F Oı/nD,,f = z HPR, f/ PR f 
for Dı D D and Rı = Rp,. The above argument will then prove the theorem 
for Dı. We explain this first in the case Dı = D, where D, differs from D only 
in replacing & by UU{q}. Let us write T, for Tp,, PR q for pry with R = Rp, 
and nq for np,,f- We recall that U; = 0 in Ty. 

We choose isomorphisms 
(2.46) T ~ Homo(T, ©), T, ~ Homo(T,, O) 
coming from the fact that each of the rings is a Gorenstein O-algebra. If 
Qq: Tg —> T is the natural map we may consider the element A, = a,od, € T 
where the adjoint is with respect to the above isomorphisms. Then it is clear 
that 


2A) (a24(nq)) = (nq) 


as principal ideals of T. In particular m(n) = t(n Aq) in O. 
Now it follows from Proposition 2.7 that the principal ideal (Ag ) is given by 


(2.48) (Aq) = ((g— 1)°(T? - (q)(1+-@))). 
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In the statement of Proposition 2.7 we used Zp-pairings 
T ~ Homz, (T, Zp), T, ~ Homz, (T4, Zp) 


to define (A4) = (aqo âq). However, using the description of the pairings 
as W (km)-algebras derived from these Zņp-pairings in the paragraph following 
(2.42) we see that the ideal (Aq) is unchanged when we use W (km)-algebra 
pairings, and hence also when we extend scalars to O as in (2.42). 

On the other hand 


Hpng/Pha < #pr/Ph: # 10/0- 1) (T2- (00 +a)?)} 


by Propositions 1.2 and 1.7. Combining this with (2.47) and (2.48) gives (2.45). 

If M Æ ¢ we use a similar argument to pass from D to Dg where this time 
D, signifies that D is unchanged except for dropping q from M. In each of 
types (A), (B), and (C) one checks from Propositions 1.2 and 1.8 that 


Hprq/Paq < #PR/PR: #H’(Qy, V*). 


This is in agreement with Propositions 2.10, 2.12 and 2.13 which give the 
corresponding change in 7 by the method described above. 

To change from an O-algebra to an Oj-algebra is straightforward (the - 
complete intersection property can be checked using [Kul, Cor. 2.8 on p. 209)), 
and to change from Se to ord we use (1.4) and (2.32). The change from str 
to ord reduces to this since by Proposition 1.1 strict deformations and Selmer 
deformations are the same. Note that for the ord case if R is a local Noetherian. 
ring and f € R is not a unit and not a zero divisor, then R is a complete 
intersection if and only if R/f is (cf. [BH, Th. 2.3.4]). This completes the 
proof of the theorem. | EJ 


Remark 2.18. If we suppose in the Selmer case that f has level N with 
p + N we can also consider the ring Ty(Mo)my (with Mo as in (2.24) and mo 
defined in the same way as for Ty(M)). This time set 


T =To(M © O, T=TyH(M)m Q O. 
0 AMO) wi o) H( m 2 


Define 7,17, po and p with respect to these rings, and let (Ap) = Qp ° Ĝp where 
æp : T — To and the adjoint is taken with respect to O-pairings on T and To. 
We then have by Proposition 2.4 


(2.49) (np) = (n- Ap) = (1: (T2 — (p)\(1 +p)*)) = (n: (a2 — (p))) 
as principal ideals of T, where ap is the unit root of z? — Tpz + p(p) = 0. 


Remark. For some earlier work on how deformation rings change with & 
see [Bo]. | 
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Chapter 3 


In this chapter we prove the main results about Conjecture 2.16. We 
begin by showing that the bound for the Selmer group to which it was reduced 
in Theorem 2.17 can be checked if one knows that the minimal Hecke ring 
is a complete intersection. Combining this with the main result of [TW] we 
complete the proof of Conjecture 2.16 under a hypothesis that ensures that a 
minimal Hecke ring exists. 


Estimates for the Selmer group 


Let po: Gal(Qs/Q) — GL2(k) be an odd irreducible representation which 
we will assume is modular. Let D be a deformation theory of type (-, 4,0, M) 
such that po is type D, where - is Selmer, strict or flat. We remind the 
reader that k is assumed to be the residue field of O. Then as explained in 
Theorem 2.14, we can pick a modular lifting pz of po of type D (altering 
k if necessary and replacing O by a ring containing Oş) provided that po 
is not induced from a character of Q(./—3) if p = 3. For the rest of this 
chapter, we will make the assumption that po is not of this exceptional type. 
Theorem 2.14 also specifies a certain minimum level and character for f and 
in particular ensures that we can pick f to have level prime to p when po|p, 
is associated to a finite flat group scheme over Zp and det po|z, = w. 

In Chapter 2, Section 3, we defined a ring Tp associated to D. Here we 
make a slight modification of this ring. In the case where - is Selmer and 
pol D, 18 associated to a finite flat group scheme and det pol Ip = w we set 


(3.1 Tp, = Thy (Mo)u 2 
(83.1) Do = Ti (Mo) 2, 


with Mo as in (2.24), H defined following (2.24) (it is actually a subgroup 
of (Z/Mo Z)*) and mp the maximal ideal of T'y(Moọo) associated to pọ. The 
same proof as in Proposition 2.15 ensures that there is a maximal ideal mọ of 
TH(Mo) with mp N Tg (Mo) = m and such that the natural map 


(3.2) Tp) = Ty(Mo)m 9 O-TH(Mo)m 9 O 


W (ko) W (ko) 
= is an isomorphism. The maximal ideal mọ which we choose is characterized by 
the properties that pmo = pp and U; € mọ for q E X — M U {p}. (The value of 
Tp or of U, for q E€ M is determined by the other operators; see the proof of 
Proposition 2.15.) We now define Tp, in general by the following: 
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Tp, is given by (3.1) if: is Se and po|p, is associated 
| to a finite flat group scheme over Z, and 
det poli, = w; 


(3.3) 


Ip, =Tp if- is str or fl, or po| D, is not associated 
to a finite flat group scheme over Zp, or 
det polz, 4 w. 


We choose a pair (f,A) of minimum level and character as given by Theo- 
rem 2.14 and this gives a homomorphism of O-algebras_ 


Tf: Tp, >02 Of. 


We set pp f = ker mp and similarly we let p R,f denote the inverse image of PT f 
in Rp. We define a principal ideal (np f) of Tp, by taking an adjoint i» of np 
with respect to pairings as in (2.42) and write 


ny ,¢ = (îs (1)). 


Note that pr f/pī p is finite and 7; (nr f) # 0 because Tp, is reduced. We 
also write ny s for nf(mr,p) if the context makes this usage reasonable. We let 
Vi = Ad pp Q K/O where py is the extension of scalars of p fr to O. 


THEOREM 3.1. Assume that D is minimal, i.e., Y = MU {p}, and that 


po is absolutely irreducible when restricted to Q (-1) 7 p). Then 


(i) #Hp(Qz/Q,Vz) < #(pa,z/PH 5)? &/#0O/nr,) 


where Cp = #(0/U? — (p)) < œo when po is Selmer and po| D, 18 associated to 
a finite flat group scheme over Zp and det po|1, =w, and cp = 1 otherwise; 

(ii) if Tp, is a complete intersection over O then (i) is an equality, Rp ~ 
Tp and Tp is a complete intersection. | 


In general, for any (not necessarily minimal) D of Selmer, strict or flat 
type, and any pf \ of type D, #H4(Qs/Q,V;) < œ if po is as above. 


Remarks. The finiteness was proved by Flach in [Fl] under some restric- 
tions on f, p and D by a different method. In particular, he did not consider 
the strict case. The bound we obtain in (i) is in fact the actual order of 
H5(Qs/Q, Vf) as follows from the main result of [TW] which proves the hy- 
pothesis of part (ii). Then applying Theorem 2.17 we obtain the order of this 
group for more general D’s associated to pọ under the condition that a minimal 
D exists associated to pp. This is stated in Theorem 3.3. 
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The case where the projective representation associated to po is dihedral 
does not always have the property that a twist of it has an associated minimal 
D. In the case where the associated quadratic field is imaginary we will give a 
different argument in Chapter 4. | 


Proof. We will assume throughout the proof that D is minimal, indicating 
only at the end the slight changes needed for the final assertion of the theorem. 
Let Q be a finite set of primes disjoint from È satisfying q = 1(p) and po(Frob q) 
having distinct eigenvalues for each q € Q. For the minimal deformation 
problem D = (-, 4,0, M), let Dg be the deformation problem described before 
(2.34); i.e., it is the refinement of (-, U Q, O, M) obtained by imposing the 
additional restriction (2.34) at each q € Q. (We will assume for the proof that 
O is chosen so O/A = k contains the eigenvalues of po(Frob q) for each q € Q.) 
We set 

T = To,, R = Rp 
and recall the definition of Tg and Rg from Chapter 2, §3 (cf. (2.35)). We 
write V for Vy and recall the definition of V(,) following (2.38). Also remember 
that mg is a maximal ideal of Ty(Nqi...q,) as in (2.35) for which pmo ~ po 
over F, (recall that this uses the same choice of embedding kmo —> k as in 
the definition of Tg). We use mg also to denote the maximal ideal of Tg if 
the context makes this reasonable. 

Consider the exact and commutative dees 


6 unr 
0 — Hp(Q:/Q,V) — Hb (Qsug/QV) > [] mag, vono 
qEQ 


0 — (Pr/p2)* ae (PRo Pro)” | ie 


f T 
0 — (PTPR — PTPR) = Koo 
where Kg is-by definition the cokernel in the horizontal sequence and * denotes 
Homo( , K/Q) for K the field of fractions of O. The key result is: 


LEMMA 3.2. The map lg is injective for any finite set of primes Q 
satisfying E 
q = 1(p), st # (q) (1+)? mod m for allg E€ Q. 


Proof. Note that the hypotheses of the lemma ensure that p9(Frob q) has 
distinct eigenvalues for each q € Q. First, consider the ideal ag of Ro defined 
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by 


(3.4) sazla: — 1, b;, c;d; — 1: (; )= proto: with o; E Iq,5 4 € o} 


Then the universal property of Rg shows that Rg/ag ~ R. This permits us 
to identify (pr/p?,)* as 


(pr/pp)* = {f € (PRo/PRo)* : Flag) = 0}. 


If we prove the same relation for the Hecke rings, i.e., with T and Tg replacing 
R and Rg then we will have the injectivity of ug. We will write ag for the 
image of ag in Tg under the map pQ of (2.37). 

It will be enough to check that for any q E€ Q’, Q’ a subset of Q, Tq: /tg > 
Tq} where ag is defined as in (3. 4) but with Q replaced by q. Let 
N’ = N (po) p®?0) - Il,,eq’—{q} gi Where 6(p0) is as defined in Theorem 2.14. 
Then take an element o € I, C Gal(Q,/Q,) which restricts to a generator 
of Gal(Q(¢n7q)/Q(Cn’)). Then det(o) = (tg) € Tq in the representation to 
GL2(Tg’) defined after Theorem 2.1. (Thus tg = 1(N’) and fg is a primitive 
root mod q.) It is easily checked that | 


(3.5) Jy (N", q)mgy (Q) = JH(N'G) mgr (Q) [ (tq) a 1). 


Here H is still a subgroup of (Z/MoZ)*. (We use here that po is not reducible 
for the injectivity and also that po is not induced from a character of Q(./—3) 
for the surjectivity when p = 3. The latter is to avoid the ramification points of © 
the covering X(N'q) —> Xy(N’,q) of order 3 which can give rise to invariant 
divisors of X7(N’q) which are not the images of divisors on Xg (N’,q).) 

Now by Corollary 1 to Theorem 2.1 the Pontrjagin duals of the modules 
in (3.5) are free of rank two. It follows that 


(3.6) (TH(N’q) mgr)? /((ta) — 1) = (THIN, ange)”: 


_ The hypotheses of the lemma imply the condition that po(Frob q) has distinct 
eigenvalues. So applying Proposition 2.4’ (at the end of §2) and the remark 
following it (or using the fact remarked in Chapter 2, §3 that this condition 
implies that pọ does not occur as the residual representation associated to any 
form which has the special representation at q) we see that after tensoring over 
W(kmg,) with O, the right-hand side of (3.6) can be replaced by TO fq} thus 
giving 
Tor/tq ~ TQ tq}: 


since (tg) — 1 € Gg. Repeated inductively this gives the desired relation 
TgQ/ag ~ T, and completes the proof of the lemma. oO 
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Suppose now that Q is a finite set of primes chosen as in the lemma. Recall 
that from the theory of congruences (Prop. 2.4’ at the end of §2) 


ne,,,/nv,¢ = [[(@-9), 
qEQ 
2 


the factors (af — (q)) being units by our hypotheses on q € Q. (We only need 


that the right-hand side divides the left which is somewhat easier.) Also, from 
the theory of Fitting ideals (see the proof of (2.44)) 


#(pr/p>) > #(O/nz,) 


#(pt9/PT.) 2 #(O/nrg,)- 
We deduce that : | 
#Kg = # g I] (q — n) . ect 
qEQ 
where t = #(pr/p4)/#(0/nr,f). Since the range of io has order given by 


#1o/ Than}. 


we compute that the index of the image of vg is < t as lg is injective. 

Keeping our assumption on Q from Lemma 3.2, consider the kernel of AM 
applied to the diagram at the beginning of the proof of the theorem. Then 
with M chosen large enough so that AM annihilates py / pa (which is finite 
because T is reduced) we get: 


0 — HE(Qs/Q, VAM) — HB, (Qzug/Q VAM) 22 [[ PHARE, VO AMER 
qEQ 


t T ve T tQ 


0 — (pp/pa,)” — (Pro /PE PY] — Ka)™] > (pr /p2,)*. 


See (1.7) for the justification that AM can be taken inside the parentheses in 
the first two terms. Let XQ = Ya((PT,> [PTa] [AM]). Then we can estimate 
the order of 6g(Xq) using the fact that the image of 1g has index at most t. 
We get 


(3.7) #5Q(XQ) 2 (1 #0O/(A”,q- D) - (17t) + (1/#(pr/Pp)). 


qEQ 
Now we choose Q to be a set of primes with the property that 
(3.8) eq: Hi» (Qs/Q, Viv) > [[ H (Qa, Všm) 


qEQ 
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is injective. We also keep the condition that LQ is injective by only allowing 
Q to contain primes of the form given in the lemma. In addition, we require 
these q’s to satisfy q = 1(p*). 


To see that this can be done, suppose that x € kereg and Ar = 0 but 
x #0. We have a commutative diagram 


H!(Qz/Q, V) A] £, TT BQ, Vite) [A] 
qEQ 
h h 


H'(Qs/Q, Vx) fo [EQ Vx) 
qEQ 
the right-hand isomorphisms coming from our particular choices of q’s and the 
left-hand isomorphism from our hypothesis on pọ. The same diagram will hold 
if we replace Q by Qo = QU {qo} and we now need to show that we can choose 
qo so that Eg, (x) Æ 0. 
The restriction map 


H!(Qs/Q, V;*) > Hom(Gal(Q/Ko(&)), Vit) Ce Ko(Ge)/Q) 


has kernel H!(Ko(¢p)/Q, k(1)) by Proposition 1.11 where here Ko is the split- 
ting field of pọ. Now if x € H'(Ko(¢,)/Q,k(1)) and z Æ 0 then p = 3 and z 
factors through an abelian extension L of Q(¢3) of exponent 3 which is non- 
abelian over Q. In this exceptional case, L must ramify at some prime q of 
Q(¢3), and if q lies over the rational prime q Æ 3 then the composite map 


H! (Ko(&)/Q, k(1)) — H*(Q7™, k(1)) — H (Q7, (O/A")(1)) 


is nonzero on x. But then z is not of type D* which gives a contradiction. This 
only leaves the possibility that L = Q(¢3, 3V3) but again this means that z is 
not of type D* as locally at the prime above 3, L is not generated by the cube 
root of a unit over Q3(¢3). This argument holds whether or not D is minimal. 

So x, which we view in ker &g, gives a nontrivial Galois-equivariant ho- 
momorphism fs E€ Hom(Gal(Q/Ko(¢,)), V;*) which factors through an abelian 
extension M, of Ko(¢,) of exponent p. Specifically we choose M, to be the © 
-= minimal such extension. Assume first that the projective representation ĝo 
associated to po is not dihedral so that Sym? po is absolutely irreducible. Pick 
a o E€ Gal(M;(¢,™)/Q) satisfying 


(3.9) (i) po(o) has order m > 3 with (m, p) = 1, 
(i) ø fixes Q(det po) (Gm), 
Gii) fz(o™) #0. 


To show that this is possible, observe first that the first two conditions can 
be achieved by Lemma 1.10(i) and the subsequent remark. Let cı be an el- 
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ement satisfying (i) and (ii) and let a denote its image in Gal(Ko(¢,)/Q). 
Then (61) acts on G = Gal(M,/Ko(¢p)) and under this action G decom- 
poses as G ~ G1 @ G} where cı acts trivially on G1 and without fixed points 
on G}. If X is any irreducible Galois stable k-subspace of f,(G) QF, k then 
ker(o; — 1)|x # 0 since Sym? po is assumed absolutely irreducible. So also 
= ker(o1 — 1)|7,(@) # 0 and thus we can find T € Gi such that f,(r) # 0. 
Viewing T as an element of G we then take 


Tı =T X 1 € Gal(Mz(G,m)/Ko(Gp)) > G x Gal(Ko(¢,™ )/Ko(¢p)) 


(This decomposition holds because M, is minimal and because Sym? po and 
Hp are distinct from the trivial representation. ) Now 7, commutes with o; and 
either fz ((71.01)™) # 0 or fe(oT) # 0. Since po(T101) = po(o1) this gives 
(3.9) with at least one of o = mic or o = oj. We may then choose qo so that 
Frob qo = ø and we will then have g(x) # 0. Note that conditions (i) and (ii) 
imply that go = 1(p) and also that po(c) has distinct eigenvalues, thus giving 
both the hypotheses of Lemma 3.2. 
If on the other hand fp is dihedral then we pick o’s satisfying 


(i) po(o) #1, 

(ii) o fixes Q(¢,u), 

(iii) fe (o™) #0, 
with m the order of po(a) (and p ¢ m since fp is dihedral). The first two condi- 
tions can be achieved using Lemma 1.12 and, in addition, we can assume that 
o takes the eigenvalue 1 on any given irreducible Galois stable subspace X | 
of W, k. Arguing as above, we find a r € G such that fz(T) # 0 and 
we proceed as before. Again, conditions (i) and (ii) imply the hypotheses of 
Lemma 3.2. So by successively adjoining g’s we can assume that Q is chosen 
so that eq is injective. 


We have thus shown that we can choose Q = {q,... ,qr} to be a finite 
set of primes q; = 1(p™) satisfying the hypotheses of Lemma 3.2 as well as the 
injectivity of €q in (3.8). By Proposition 1.6, the injectivity of EQ implies that 


(3.10) # Hp(Qzug/Q, VA] = hoo: J] ha 


qELUQ 


Here we are using the convention explained after Proposition 1.6 to define Hp. 
Now, as D was chosen to be minimal, hg = 1 frq E © —{p} by Proposi- 
tion 1.8. Also, hg = #(O/A™)? for q € Q. If - is str or fl then holy = 1 
by Proposition 1.9 (iv) and (v). If - is Se, hoohp < cp by Proposition 1.9 (iii). 
(To compute this we can assume that J, acts on we via w, as otherwise we 
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get hohp < 1. Then with this hypothesis, (WY, )* is easily verified to be un- 
ramified with Frob p acting as U2(p)~* by the description of p;,,|p, in [Wil, 
Th. 2.1.4].) On the other hand, we have constructed classes which are ramified 
at primes in Q in (3.7). These are of type Dg. We also have classes in 


Hom(Gal(Qsug/Q),0/A™”) = H1(Qsug/Q, O/”) > H! (Qsug/Q, Vym) | 


coming from the cyclotomic extension Q(g, ... Cg). These are of type D and 
disjoint from the classes obtained from (3.7). Combining these with (3.10) 
gives — 


# Hd(Qs/Q, V[A™]) < t - #(pr/pp) - op 


as required. This proves part (i) of Theorem 3.1. 

Now if we assume that T is a complete intersection we have that t = 1 
by Proposition 2 of the appendix. In the strict or flat cases (and indeed in 
all cases where cp = 1) this implies that Rp ~ Tp by Proposition 1 of the 
appendix together with Proposition 1.2. In the Selmer case we get 


(3.11) #(pr/p>) - cp = # (O/nT slop = #(C/n0p,,) < # PTD /P rp) 


where the central equality is by Remark 2.18 aud the right-hand inequality 
is from the theory of Fitting ideals. Now applying part (i) we see that the 
inequality in (3.11) is an equality. By Proposition 2 of the appendix, Tp is 
also a complete intersection. 

The final assertion of the theorem is proved in exactly the same way on 
noting that we only used the minimality to ensure that the h,’s were 1. In 
general, they are bounded independent of M and easily computed. (The only 
point to note is that if pz is of multiplicative type at q then py, alpD, does not 
split.) | O 


Remark. The ring Tp, defined in (3.1) and used in this chapter should 
be the deformation ring associated to the following deformation problem Do. 
One alters D only by replacing the Selmer condition by the condition that the 
deformations be flat in the sense of Chapter 1, i.e., that each deformation p 
of po to GL2(A) has the property that for any quotient A/a of finite order, 
p|p, mod a is the Galois representation associated to the Q,-points of a finite 
` flat group scheme over Zp. (Of course, po is ordinary here in contrast to our 
usual assumption for flat deformations. ) 


From Theorem 3.1 we deduce our main results about representations by 
using the main result of [TW], which proves the hypothesis of Theorem 3.1 
(ii), and then applying Theorem 2.17. More precisely, the main result of [TW] 
shows that T is a complete intersection and hence that t = 1 as explained 
above. The hypothesis of Theorem 2.17 is then given by Theorem 3.1(i), 
together with the equality t = 1 (and the central equality of (3.11) in the 
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Selmer case) and Proposition 1.2. Strictly speaking, Theorem 1 of [TW] refers 
to a slightly smaller class of D’s than those covered by Theorem 3.1 but up to 
a twist every such D is covered. It is straightforward to see that it is enough 
to check Theorem 3.3 for po up to a suitable twist. 


THEOREM 3.3. Assume that po is modular and absolutely irreducible 
pol 
when restricted to Q | y (—1) 2 p). Assume also that po is of type (A), (B) 
or (C) at eachq#pin&. Then the map yp: Rp — Tp of Conjecture 2.16 
is an isomorphism for all D associated to po, i.e., where D = (-,4,0,M) with 
- = Se,str,fl or ord. In particular if - = Se, str or fl and f is any newform 
for which pf is a deformation of po of type D then 


# Hp(Qz/Q, Vs) = # (O/np,s) < œ% 
where np s is the invariant defined in Chapter 2 prior to (2.43). 


The condition at q Æ p in È ensures that there is a minimal D associated 
to po. The computation of the Selmer group follows from Theorem 2.17 and 
Proposition 1.2. Theorem 0.2 of the introduction follows from Theorem 3.3, 
after it is checked that a twist of a pọ as in Theorem 0.2 satisfies the hypotheses- 
of Theorem 3.3. | 


Chapter 4 


In this chapter we give a different (and slightly more general) derivation 
of the bound for the Selmer group in the CM case. In the first section we 
estimate the Selmer group using the main theorem of [Ru 4] which is based on 
Kolyvagin’s method. In the second section we use a calculation of Hida to relate 
the 7-invariant to special values of an L-function. Some of these computations 
are valid in the non-CM case also. They are needed if one wishes to give the 
order of the Selmer group in terms of the special value of an L-function. 


1. The ordinary CM case 


In this section we estimate the order of the Selmer group in the ordinary 
CM case. In Section 1 we use the proof of the main conjecture by Rubin to 
bound the Selmer group in terms of an L-function. The methods are standard 
(cf. [de Sh]) and some special cases have been described elsewhere (cf. [Guo]). 
In Section 2 we use a calculation of Hida to relate this to the 7-invariant. 

We assume that | 


(4.1) : p = ndÊ k: Gal(Q/Q) — GL2(O) 
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is the p-adic representation associated to a character « : Gal(L/L) > O% of 
an imaginary quadratic field L. We assume that p is unramified in L and that « 
factors through an extension of L whose Galois group has the form A ~ Z,®T 
where T is a finite group of order prime to p. The ring O is assumed to be the 
ring of integers of a local field with maximal ideal À and we also assume that 
p is a Selmer deformation of pọ = pmod A which is supposed irreducible with 
det po|7, = w.-In particular it follows that p splits in L, p = pp say, and that 
precisely one of x, &* is ramified at p («* being the character r — K(oTo~') 
for any o representing the nontrivial coset in Gal(Q/Q) I Gal(Q/L)). We can 
suppose without loss of generality that « is ramified at p. 

We consider the representation module V ~ (K/O)* (where K is the field 
of fractions of ©) and the representation is via Ad p. In this case V splits as 


V ~Y @(K/O)(~) 6 K/O 


where w is the quadratic character of Gal(Q/Q) associated to L. We let = 
denote a finite set of primes including all those which ramify in p (and in 
particular p). Our aim is to compute H3,(Qs/Q, V). The decomposition of 
V gives a corresponding decomposition of H'(Qs/Q, V) and we can use it to 
define H4 (Qz/Q, Y). Since W? c Y (see Chapter 1 for the definition of W?) 
we can define Hg.(Qy,/Q, Y) by 


Hg.(Qz/Q, Y) = ker{H*(Qz/Q,Y) > H*(Q>™, Y/W°)}. 


Let Y* be the arithmetic dual of Y, i.e., Hom(Y, ppo) @ Qp/Zp. Write 
v for ke/k* and let L(v) be the splitting field of v. Then we claim that 
Gal(L(v)/L) ~ Zp ® T’ with T” a finite group of order prime to p. For this 
it is enough to show that x = K«*/e factors through a group of order prime 
to p since v = k*y~!. Suppose that y has order m = mop" with (mo, p) = 1. 
Then x’ extends to a character of Q which is then unramified at p since the 
same is true of x. Also it factors through an abelian extension of L with Galois 
group isomorphic to Z2 since x factors through such an extension with Galois 
group isomorphic to Z5 ® Tı with T; of order prime to p (the composite of the 
splitting fields of k and «*). It follows that x™? is also unramified outside p, 
whence it is trivial. This proves the claim. 7 

Over L there is an isomorphism of Galois modules 


Y* ~ (K/O)(v) 6 (K/O)(v~*e"). 
In analogy to the above we define H4. (Qg/ Q, Y*) by 
H3_(Qz/Q, ¥*) = ker{H"(Qz/Q,¥*) > HHQ", (W°)*)}. 


Analogous definitions apply if Y* is replaced by Yn. Also we say informally 
that a cohomology class is Selmer at p if it vanishes in H LOR: (W°)*) (resp. 
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H*(Q3"", (Wxn)*)). Let Mo. be the maximal abelian p-extension of L(v) un- 
ramified outside p. The following proposition generalizes [CS, Prop. 5.9]. 


PROPOSITION 4.1. There is an isomorphism 
Hinr(Qs/Q, Y*) 2 — Hom (Gal(M,,/L(v)), (K/O)(v))GXl@)/2) 


where Hi. denotes the subgroup of classes which are Selmer at p and unram- 
ified everywhere else. 


Proof. The sequence is obtained from the inflation-restriction sequence as 
follows. First we can replace H!(Qs/Q, Y*) by 


| | = A 
{H (Qs/L, (K/O)(v)) © H' (Qs/L, (K/O)(v-1e?)) } 
where A = Gal(L/Q). The unramified condition then translates into the 
requirement that the cohomology class should lie in 
{Hane in 2-p(Qz/L, (K/O)(v)) ® Hinr in n-p (Qu/L, (K/O)(v-1e?)) } 


Since A interchanges the two groups inside the parentheses it is enough to 
compute the first of them, i.e., 


(4.2) Hinr in D—p (Qs/L, k/O(v)) 
The inflation-restriction sequence applied to this gives an exact sequence 
(4.3) 0 — Him in sp (L(v)/L, (K/O)(v)) 


= M inr in O—p (Qs/L, (A/O)(v)) | 
+ Hom (Gal(Moo/L(v)), (K/O)(v))SM#EO/) , 


The first term is zero as one easily checks using the divisibility of (K/O)(v). 
Next note that H? (L(v)/L, (K/O)(v)) is trivial. If v Æ 1(A) this is straight- 
forward (cf. Lemma 2.2 of [Rul]). If v = 1(A) then Gal (L(v)/L) = Zp and so 
it is trivial in this case also. It follows that any class in the final term of (4.3) 
lifts to a class c in H+ (Qs/L, (K/O)(v)). Let Lo be the splitting field of Y-. 
Then M..Lo/Lo is unramified outside p and Lo/L has degree prime to p. It 
follows that c is unramified outside p. O 


Now write Hå (Qz/Q, Y,*) (where Y,* = Y;*, and similarly for Yņ) for 


the subgroup of H1,.(Qs/Q, Yn) given by 
| Hy (Qz/Q, Ya) = fo € H abe (Qs/Q, Ye Qp = 0 in H’ (Qp, Yn) )\ 


where (Y,*)° is the first step in the filtration under Dp, thus equal to (Y;,/Y,°)* 
or equivalently to (Y*)9, where (Y*)° is the divisible submodule of Y* on 


which the action of J, is via e°. (If p Æ 3 one can characterize (Y,*)° as the 
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maximal submodule on which I, acts via e°.) A similar definition applies with | 
Y, replacing Y,*. It follows eon an examination of the action of Ip on Y) that 


(4.4) | H4 (Qz/Q, Yn) = Hinr(Qs/Q, Y; n): 


In the case of Y* we will use the inequality 


(4.5) # H4. (Qz/Q, Y*) < # Hla(Qz/Q, Y "Y: 
We also need the fact that for n sufficiently large the map 


(4.6) H3..(Qs/Q, Yn) =R H4 (Qz/Q, Y*) 
is injective. One can check this by replacing these groups by the subgroups 
of H1(L,(K/O)(v),n) and H'(L,(K/O)(v)) which are unramified outside p 
and trivial at p*, in a manner similar to the beginning of the proof of Proposi- 
tion 4.1. The above map is then injective whenever the connecting homomor- 
phism 
H? (Lys, (K/O)(v)) > H' (Lpr, (K/O)(v)n) 

is injective, which holds for sufficiently large n. 

Now, V ae 1.6, 


Hi,.(Qs/Q, Y, a) # H? (Q, Ya) 
E Yn) # H? (Q, Ya) 


Also, H® iat Y„) = 0 and a simple calculation shows that 
l inf #(O/1—v(q)) ifv=1modaA 
1 


(4.7) = # H” (Qp, (¥2)") 


# H(Q, Yz)= 


otherwise 


where q runs through a set of primes of Oz prime to pcond(v) of density one. 
This can be checked since Y* = Ind (v) g K/O. So, setting 


(4.8) le l ~ #(O/(1 — v(a))) i Go 7 i 
we get 
(4.9) 
# Hg.(Qz/Q, Y) a= -JI 4: # Hom (Gal (Mœ/L(v)), (KJO) (vyj EE) 


t oo 
where ¢, = # H°(Q,, Y*) for q p. p, ly = lim # H"(Qp, (Y,°)*). This follows 
from Proposition 4.1, (4.4)—-(4.7) and the elementary estimate | 


(4.10) #(H2,(Qn/Q,Y)/H,(Qn/Q,Y))< JT] k 


qex—{p} 


which follows from the fact that #H*(Q?™, y JEQQ) = 4. 
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Our objective is to compute Hf,(Qs/Q, V) and the main problem is to es- 
_ timate Hå (Qgz/Q,Y). By (4.5) this in turn reduces to the problem of estimat- 
ing 

# Hom(Gal(M,./L(v)), (K/O)(v))S#4)/4)). This order can be computed 
` using the ‘main conjecture’ established by Rubin using ideas of Kolyvagin. (cf. 
[Ru2] and especially [Ru4]. In the former reference Rubin assumes that the 
class number of L is prime to p.) We could now derive the result directly from 
this by referring to [de Sh, Ch. 3], but we will recall some of the steps here. 

Let w; denote the number of roots of unity Ç of L such that ¢ = 1modf 
(f an integral ideal of Oz). We choose an f prime to p such that w; = 1. 
Then there is a grossencharacter y of L satisfying y((a)) = a for a = 1modf 
(cf. [de Sh, II.1.4]). According to Weil, after fixing an embedding QÑ — Qp we 
can associate a p-adic character Yp to (cf. [de Sh, II.1.1 (5)]). We choose 
an embedding corresponding to a prime above p and then we find YPp = KX 
for some x of finite order and conductor prime to p. Indeed p and «K are 
both unramified at p* and satisfy y,|;, = K|;, = € where € is the cyclotomic 
character and J, is an inertia group at p. Without altering f we can even choose 
y so that the order of x is prime to p. This is by our hypothesis that « factored 
through an extension of the form Z, ®T with T of order prime to p. To see 
this pick an abelian splitting field of pp and « whose Galois group has the form 
GGC with G a pro-p-group and G’ of order prime to p. Then we see that 
p|e has conductor dividing fp. Also the only primes which ramify in a Zp- 
extension lie above p so our hypothesis on « ensures that «|g has conductor 
dividing fp. The same is then true of the p-part of x which therefore has 
conductor dividing f. We can therefore adjust p so that x has order prime 
to p as claimed. We will not however choose y so that x is 1 as this would 
require fp™ to be divisible by cond x. However we will make the assumption, 
by altering f if necessary, but still keeping f prime to p, that both v and Pp 
have conductor dividing fp°°. Thus we replace fp% by l.c.m.{f, cond v}. 

The grossencharacter y (or more precisely p o N F/L) is associated to a 
(unique) elliptic curve E defined over F = L(f), the ray class field of conductor 
f, with complex multiplication by Oz and isomorphic over C to C/O; (cf. 
[de Sh, II. Lemma 1.4]). We may even fix a Weierstrass model of E over Or 
which has good reduction at all primes above p. For each prime % of F above 
p we have a formal group Eg, and this is a relative Lubin-Tate group with 
respect to Fp over Ly (cf. [de Sh, Ch. II, §1.10]). We let A = Af, be the 
logarithm of this o group. 

Let U be the product of the principal local units at the primes above p 


of L(fp™); i 


Uss = I] Ooo, where U~% = lim Un, sp, 
Plp | 
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each Un being the principal local units in L(fp”)y. (Note that the ers 
-of L(f) above p are totally ramified in L(fp°°) so we still call them {$}.) W 
wish to define certain homomorphisms 6; on Uso. These were first nek 
in [CW] in the case where the local field Fy is Qp. 

Assume for the moment that Fy is Qp. In this case Êg is isomorphic to 
the Lubin-Tate group associated to mz + x? where m = y(p). Then letting wn 
be nontrivial roots of [x”|(x) = 0 chosen so that [r](wn) = wn-1, it was shown 
in [CW] that to each element u = lim Un E€ Uo, there corresponded a unique 
power series f,(T’) € Z,[T]* such that fu(wn) = Un for n > 1. The definition 
of kp (k > 1) in this case was then 


ôk plu) = (sa oy log fu (T) 


It is easy to see that p gives a homomorphism: Ux. — Ux, —> O, satisfying 
fr ple?) = O(c)" bk.p(€) where 0: Gal (F/F) — OX is the character giving 
the action on E|p™]. 

The construction of the power series in [CW] does not extend to the case. 
_ where the formal group has height > 1 or to the case where it is defined over 
an extension of Qp. A more natural approach was developed by Coleman [Co] 
which works in general. (See also [Iw1].) The corresponding generalizations of 
k were given in somewhat greater generality in [Ru3] and then in full generality 
by de Shalit [de Sh]. We now summarize these results, thus returning to the 
general case where Fẹ is not assumed to be Qp. . 

To an element u = lim Un € Ua we can associate a power series fu,p(T) € 
Ox|[T]]* where Og is the ring of integers of Fp; see [de Sh, Ch. IT 84.5]. (More 
precisely fu,p(T) is the -component of the power series described there.) For 
P we will choose the prime above p corresponding to our chosen embedding 
Q — Q,. This power series satisfies unp = (fu,p)(Wn) for all n > 0, n = 0(d) 
where d = [Fy: Lp] and {wn} is chosen as before as an inverse system of r” 
division points of Ey. We define a homomorphism 6; : U» — Og by 


T=0 


k 
(4.11) klu) : = ôk plu) = (z Y T) 5) log fu,g(T) 
l T=0 
Then 
(4.12) by (u7) = O(r)* 6, (u) for r € Gal(F/F) 


where 0 again denotes the action on E[p~]. Now 0 = pp on Gal(F'/F). We 
actually want a homomorphism on U, with a transformation property corre- 
sponding to v on all of Gal(L/L). Observe that v = y? on Gal(F/F). Let S 
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be a set of coset representatives for Gal(L/L)/ Gal (L/F) and define 
(4.13) =y. v~t (o)êz(u”) € Ox|v]. 
oES 

.Each term is independent of the choice of coset representative by (4. 8) and it 
is easily checked that | 

(u7) = v(o)®o(u). 
It takes integral values in Ox|[v]. Let Us(v) denote the product of the groups 
of local principal units at the primes above p of the field L(v) (by which we 
mean projective limits of local principal units as before). Then 2 factors 
through U,.(v) and thus defines a continuous homomorphism 


| 2: Uo (v) Cp. 
Let Cæ be the group of projective limits of elliptic units in L(v) as defined 


in [Ru4]. Then we have a crucial theorem of Rubin (cf. [Ru4], [Ru2]), proved 
using ideas of Kolyvagin: 


THEOREM 4.2. There is an equality of characteristic ideals as A = 
Zp{|Gal(L(v)/L)]|-modules: 


char (Gal (M.,/L(v))) = char, (Ugo(v)/Coo)- 


Let vo = v mod à. For any Zp[Gal(L(vo) /L)|-module X we write X™%) 
for the maximal quotient of X g O on which the action of Gal(L(vo)/L) is via 


p 
the Teichmüller lift of vo. Since Gal(L(v)/L) decomposes into a direct product 
of a pro-p group and a group of order prime to p, 
Gal(L(v)/L) ~ Gal(L(v)/L(vo)) x Gal(L(vo)/L), 
we can also consider any Zp|[Gal(L(v)/L)]]-module also as a Zp[Gal(L(vo)/L)]- 
module. In particular X°) is a module over Zp[Gal(L(vp)/L)]) ~ O. Also 
AM) ~ O[[T]]. 

Now according to results of Iwasawa (|[Iw2, §12], [Ru2, Theorem 5.1]), 
Uso(v)) is a free A®)-module of rank one. We extend ®2 O-linearly to 
Ux(v) ®z, O and it then factors through U.o(v )(%0). Suppose that u is a 
generator of U,.(v)‘”°) and 8 an element of CY) Then f(y—1)u = 8 for some 
f(T) € O[[T]] and y a topological generator of Gal (L(v)/L(vp)). Computing 
ə on both u and @ gives 


(4.14) © FE) — 1) = 82(8)/82(u). 
Next we let e(a) be the projective limit of elliptic units in lim lim Lyon for 


a some ideal prime to 6fp described in [de Sh, Ch. II, §4.9]. Then by the 
proposition of Chapter II, §2.7 of [de Sh] this is a 12t? power in lim Lin- We 
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let 61 = B(a)!/!? be the projection of e(a)!/!2 to U» and take B = Norm 6; 
where the norm is from L,,0 to L(v). A generalization of the calculation in 
[CW] which may be found in [de Sh, Ch. II, §4.10] shows that 


(4.15) ®2(G) = (root of unity) Q7? (Na — v(a)) L;(2, 0) € Og[v} 


where {2 is a basis for the O;-module of periods of our chosen Weierstrass model 
of Ep. (Recall that this was chosen to have good reduction at primes above p. 
The periods are those of the standard Neron differential.) Also v here should 
be interpreted as the grossencharacter whose associated p-adic character, via 
the chosen embedding Q — Qp, is v, and T is the complex conjugate of v. 

The only restrictions we have placed on f are that (i) f is prime to p; 
(ii) w; = 1; and (iii) condv | fp. Now let fop®© be the conductor of v with fo 
prime to p. We show now that we can choose f such that L;(2,7)/L;,(2,) is 
a p-adic unit unless vọ = 1 in which case we can choose it to be t as defined 
in (4.4). We can clearly choose L,(2,7)/L;,(2,7) to be a unit if v Æ 1, as 
V(q)v(q) = Normg? for any ideal q prime to fop. Note that if vo = 1 then also 
p = 3. Also if vy = 1 then we see that | 


inf # { O/{Lrog(2,7)/Ly(2,0)}} = t 
since Ve? = vy. | 
We can compute ®2(u) by choosing a special local unit and showing that 
$2(u) is a p-adic unit, but it is sufficient for us to know that it is integral. Then 
since Gal (M.,./L(v)) has no finite A-submodule (by a result of Greenberg; see 
[Gre2, end of §4]) we deduce from Theorem 4.2, (4.14) and (4.15) that 


# Hom(Gal (M,./L(v)), ( KIOU TH 


2 l #0O/Q Lp (2,0) if vo #1 
T | (#0/Q~7L,,(2,7))-t if vy =1. 


2 


Combining this with (4.9) gives: 
# Hge(Qu/Q, Y) < # (0/27 L5(2,2)) -T] 4 


qEX 


where 44 = # H?(Qu, Y*) (for q Æp), lp = # H? (Qr, (Y°)*). 
Since V ~ Y @ (K/O) (Yy) & K/O we need also a formula for 


# ker{ H'(Qs/Q, (K/O)(Y) © K/O) + HQ, (K/O)W) @ K/O)\. 


This is easily computed to be 


(4.16) #(O/hr)- J h 


qEX-—{p} 
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where 4; = #H°(Q,, ((K/O)(w) 6 K/O)*) and hz is the class number of O,. 
Combining these gives: 


PROPOSITION 4.3. 


#H3.(Qs/Q,V) < #(O/Q~* Ly, (2,0)) -#(O/hz) - [| & 


qed 


where Ly = #H"(Qy,V*) (for q # p), fp = #H(Qy,(Y°)*). 


2. Calculation of 7 


We need to calculate explicitly the invariants np, introduced in Chapter 2, 
§3 in a special case. Let po be an irreducible representation as in (1.1). Suppose 
that f is a newform of weight 2 and level N, À a prime of O+ above p and p f,r a 
deformation of pọ. Let m be the kernel of the homomorphism T;(N) — Of/d 
arising from f. We write T for T1(N)m "i O, where O = Of) and km is 


the residue field of m. Assume that p { N. We assume here that k is the 
residue field of O and that it is chosen to contain km. Then by Corollary 1 of 
Theorem 2.1, T1(V)m is Gorenstein and it follows that T is also a Gorenstein 
O-algebra (see the discussion following (2.42)). So we can use perfect pairings 
(the second one T-bilinear) 


0x0 — O, hke TXT 0 


to define an invariant 7 of T. If 7: T — © is the natural map, we set 
(n) = (#(1)) where 7 is the adjoint of m with respect to the pairings. It is 
well-defined as an ideal of T, depending only on m. Furthermore, as we noted 
in Chapter 2, §3, m(n) = (n, n) up to a unit in O and as noted in the appendix 
n = Annp = T[|p] where p = kerm. We now give an explicit formula for 7 
developed by Hida (cf. [Hi2] for a survey of his earlier results) by interpreting 
(, ) in terms of the cup product pairing on the cohomology of X,(N), and 
then in terms of the Petersson inner product of f with itself. The following 
account (which does not require the CM hypothesis) is adapted from [Hi2] and 
we refer there for more details. | 
Let 


(4.17) (, ): H! (X(N), Of) x H! (Xi(N), Of) > OF 


be the cup product pairing with Oş as coefficients. (We sometimes drop the 
C from Xı(N);c or Ji(N) co if the context makes it clear that we are re- 
ferring to the complex manifolds.) In particular (t,7,y) = (z,t*y) for all 
x,y and for each standard Hecke correspondence t. We use the action of t on 
H'(X\(N), Of) given by x + t*x and simply write tx for t*x. This is the same 
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as the action induced by tą € Ti(N) on H'(Ji\(N),Of) ~ H'(Xi(N), Of). 
Let pf be the minimal prime of Tı(N) @ Oş associated to f (i.e., the kernel of 
Ti(N) @O — Of given by tı 8 8 — Balf) where tf = q(f)f), and let 


Ly = H'(X(N), Of) [pz]. 


If f = Nang” let fP = Lang”. Then f’ is again a newform and we define 
Lye by replacing f by f? in the definition of Ly. (Note here that Os = Ofo 
as these rings are the integers of fields which are either totally real or CM by 
a result of Shimura. Actually this is not essential as we could replace Oy by 
any ring of integers containing it.) Then the pairing (, ) induces another by 
restriction 


(4.18) | (,): LyX Lye > Of. 


Replacing Oş (and the Or-modules) by the localization of Oş at p (if necessary) 
we can assume that Ly and Lfe are free of rank 2 and direct summands as 
O;-modules of the respective cohomology groups. Let 61,62 be a basis of Ly. 
Then also 61,69 is a basis of L fe = Ly. Here complex conjugation acts on 
H'(X(N), Of) via its action on Oy. We can then verify that 


(6, ô) = det (6;, 6;) 


is an element of Of (or its localization at p) whose image in Of) is given by 
t(n?) (unit). To see this, consider a modified pairing ( , } defined by 


(4.19) = (2,4) = (z, wey) 


where we is defined as in (2.4). Then (tz, y) = (x, ty) for all z, y and Hecke 
operators t. Furthermore 


det (5;,6;) = det(5;, wed;) = cdet(6,, 5;) 


for some p-adic unit c (in Of). This is because we(Lyo) = Ly and we (Ly) = 
Lfe. (One can check this, for example, using the explicit bases described 
below.) Moreover, by Theorem 2.1, 


H'(Xi(N), Z) ST) Ti(N)m ~ Ti(N)2, 
H'(Xi(N), Of) @rymgo,T ~ T°. 


_ Thus (4.18) can be viewed (after tensoring with O; and modifying it as in 
(4.19)) as a perfect pairing of T-modules and so this serves to compute 7 (n?) 
as explained earlier (the square coming from the fact that we have a rank 2 
module). | | 

To give a more useful expression for (6, 6) we observe that f and f’ can be 
viewed as elements of H! (Xi(N),C) ~ Hdp(Xi(N), C) via f œ f(z)dz, fe rH 
fedz. Then {f, fP} form a basis for Lf 80, C. Similarly {f, f°} form a basis 
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for Lye @o0, C. Define the vectors wı = (f, fP), w2 = (F, f?) and write 
w1 = Cô and w2 = Cé with C € M2(C). Then writing fı = f, fo = f we set 


(w,@) := det((fi, fj) = (6,8) det(CC), 


Now (w, œ) is given explicitly in terms of the (non-normalized) Petersson inner 
product (,): , 
(w, a) = —4(f, P 


where (f, f) = Jota (N) f fdxdy. 

To compute det(C’) we consider integrals over classes in Hy (Xı(N), Of). 
By Poincaré duality there exist classes c1,c2 in Hy(Xi(N), Oy) such that 
det ( Je 6;) is a unit in Of. Hence det C generates the same O;-module as- 


is generated by { det ( Se, fi) } for all such choices of classes (c1, c2) and with 


{fi, fo = ff, fP}. Letting uy be a generator of the Os-module { det ( Se, fi)\ 
we have the following formula of Hida: 


PROPOSITION 4.4. n(n?) = (f, f)*/uy tig x (unit in Ofa). 


Now we restrict to the case where pp = Ind? Ko for some imaginary 
quadratic field L which is unramified at p and some k*-valued character ko 
of Gal(L/L). We assume that po is irreducible, i.e., that ko # Ko, where 
Ko,c(6) = kolotoc) for any o representing the nontrivial coset of 
Gal(L/Q)/Gal(L/L). In addition we wish to assume that po is ordinary and 
det po|z, =w. In particular p splits in L. These conditions imply that, if p is a 
prime of L above p, ko(a) = a! modp on U, after possible replacement of ko 
by Ko,c. Here the U, are the units of L, and since Ko is a character, the restric- 
tion of Ko to an inertia group I, induces a homomorphism on U,. We assume 
now that p is fixed and Ko chosen to satisfy this congruence. Our choice of 
Ko will imply that the grossencharacter introduced below has conductor prime 
to p. 

We choose a (primitive) grossencharacter p on L together with an em- 
bedding Q — Q, corresponding to the prime p above p such that the induced 
p-adic character pp has the properties: , 


(i) ~p mod p = ko (P = maximal ideal of Q,). 


(ii) Yp factors through an abelian extension isomorphic to Zp ® T with T of 
finite order prime to p. 


(iii) y((a)) = a for a = 1(f) for some integral ideal f prime to p. 


To obtain ọ it is necessary first to define yp. Let Mæ denote the maximal 
abelian extension of L which is unramified outside p. Let 6:Gal(M/L) —> 
Q,* be any character which factors through a Z,-extension and induces the 
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homomorphism a ++ a7" on U,1 > Gal(M/L) where U,1 = {u € Up: u = 
1(p)}. Then set pp = Ko, and pick a grossencharacter p such that (~)» = pp- 
Note that our choice of y here is not necessarily intended to be the same as 
the choice of grossencharacter in Section 1. 

Now let fp be the conductor of y and let F be the ray class field of con- 
ductor ee Then over F there is an elliptic curve, unique up to isomorphism, 
with complex multiplication by Oz and period lattice free, of rank one over Or, 
and with associated grossencharacter po Np JL- The curve Ep is the extension 
of scalars of a unique elliptic curve E jF+ where F* is the real subfield of F of 
index 2. (See [Sh1, (5.4.3)].) Over F* this elliptic curve has only the p-power 
isogenies of the form +p™ for m € Z. To see this observe that F is unramified 
at p and po is ordinary so that the only isogenies of degree p over F are the 
ones that correspond to division by ker p and ker p’ where pp’ = (p) in L. Over 
F* these two subgroups are interchanged by complex conjugation, which gives 
the assertion. We let E/o, Sao denote a Weierstrass model over Op+ (p), the 
localization of Or+ at p, with good reduction at the primes above p. Let wg 


be a Neron differential of E Oreo Let Q be a basis for the Oz-module of 


periods of wg. Then 2 = u- Q for some p-adic unit in F*. | 

According to a theorem of Hecke, ọ is associated to a cusp form fọ in such 
a way that the L-series L(s, p) and L(s, fọ) are equal (cf. [Sh4, Lemma 3)]). 
Moreover since y was assumed primitive, f = fy is a newform. Thus the 
integer N = cond f = |A;/Q| Norm; /qQ(cond p) is prime to p and there is a 
homomorphism E 

Wr: Tı(N)>Rf C OF C O, 

satisfying Ys (Tı) = p(c)+¢(c) if l = cc in L, (l { N) and w;(T)) = 0 if l is inert 
in L (l { N). Also ys(i(l)) = v((1))(1) where w is the quadratic character 
associated to L. Using the embedding of Q in Q, chosen above we get a 
prime A of Oş above p, a maximal ideal m of Tı (N) and a homomorphism 
Tı(N)m —> Of), such that the associated representation pf,. reduces to 


po mod A. 
Let po = ker py: Tı (N) — Oş and let 


As = Ji\(N)/po (N) 
be the abelian variety associated to f by Shimura. Over F* there is an isogeny 


Appr ~ (Eps) 
where d = [Of: Z] (see [Sh4, Th. 1]). To see this one checks that the p-adic Ga- 
lois representations associated to the Tate modules on each side are equivalent 
to (IndẸ" Pp) DZ, Kf,p where Kf p = Of ® Qp and where yp: Gal( F/F) > Z% 
is the p-adic character associated to y and restricted to F. (One coniare | 
trace(Frob £) in the two representations for £ + Np and £ split completely in 
F*; cf. the discussion after Theorem 2.1 for the representation on A f+) 
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Now pick a nonconstant map | 
T: Xı(N)jF+ => Ejp+ 


which factors through As;p+. Let M be the composite of F+ and the nor- 
mal closure of Ky viewed in C. Let wg be a Neron differential of Ejo, T 
Extending scalars to M we can write 


a) = > AgW fe, ac E M 
o€Hom(K+,C) 


where wfo = = anl fo) q for each o. By suitably choosing m we can assume 


that aiq £ 0. "Phen there ee Ai € Om and t; E€ T1(NV) such that 
`> AitiT" WE = Ciwf for some cı E€ M. 

We consider the map 

(4.20) n": Hi(Xi(N) 0, Z) 8 Omg) > HilEyc,Z) 8 Om) 


A by m’ = >> A;i(t oti). Even if 7’ is not surjective we claim that the image 
of 7’ always has the form Ai{E/o; Z) ® aOy(p) for some a € Oy. This is 
because tensored with Zp 7’ can be viewed as a Gal(Q/F't)-equivariant map 
of p-adic Tate-modules, a the only p-power isogenies on E;p+ have the form 
+p” for some m € Z. It follows that we can factor 7’ as (1 8 a) oa for some 

other surjective a | 


a: Hi(X1(N)/c, Z) ® Ou > H*(E/c, Z) 8 Om, 


now allowing a to be in Oxy (p). Now define a* on QL IC by a* = Sia + )jtjon* 
where miO ni o3 (N)/C is the map induced by m and t; has the usual 
action on O35, N/C Then a*(wg) = cw for some c E€ M and 


(4.21) f wn) = [ve 


J a(y) 


for any class y € Hi(X1(N)/c,Om). We note that a (on homology as in 
(4.20)) also comes from a map of abelian varieties a: Jı (N) p+ @z Om > 
E;p+ ®z Om although we have not used this to define a* 

We claim now that c € Oy (p): We can compute a TE by considering 
a* (we 81) = Z tin* Qat; on Qe) p+ & Oy and then mapping the image in 
O8(N)/ Ft © Ox to Q} (N)/F+ DO m4 Ou = Q3. (N)/M° Now let us write O1 for 
O r+ (p)- Then there are isomorphisms 


s1 s2 


1 ee 1 ina r =l 
VN) o, 802: ”? Hom(Om, QFN) /0,) = QW) o, 8 ó 
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where 6 is the different of M/Q. The first isomorphism can be described as 
follows. Let e(y): (N) > Ji(N) 8 Om for y E€ Oy be the map xt > © ®@¥. 
Then tı (w)(y) = e(7)*w. Similar identifications occur for E in place of Jı (N). 
So to check that a*(wg 8 1) € 5, (N) jo, ® Om it is enough to observe that by 
its construction a comes from a homomorphism Ji(N)/o,®Om > Eo, g0 M- 
It follows that we can compare the periods of f and of wp. 

For f? we use the fact that f, fPdz = fye f dz where c is the Om-linear 
map on homology coming from complex conjugation on the curve. We deduce: 


PROPOSITION 4.5. uf = zz 9?.(1/(p-adic integer)). 


We now give an expression for (fp, fọ) in terms of the L-function of ¢. 
This was first observed by Shimura [Sh2] although the precise form we want 
was given by Hida. | 


PROPOSITION 4.6. 


otd = ea 8?) TI (1- =) p tw 2y*8) Lu (dv) 


aS 


To 


where x is the character of fọ and x its restriction to L; 
~ ab is the quadratic character associated to L; 
Ly( ) denotes that the Euler factors for primes dividing N have been 
removed; 
S. is the set of primes q | N such that q = qq’ with q ł condy and q, q 
primes of L, not necessarily distinct. 


Proof. One begins with a formula of Petersson that for an eigenform of 
weight 2 on T(N) says 


(ff) = (An)? T (2) (5) 7 [SL2(Z) : T(N) -(41)] - Ressa D(s, f, f°) 


where D(s, f, fP) = L lanl n~: if f = = anq” (cf. [Hi3, (5.13)]). One checks 


n= 
that, removing the Fuler factors at pines dividing N, 


Dn(s, f, f’) = In(s, p 25) In(s — 1; ~)Ca,n(s 7 1)/Cq,n (28 m 2) 


by using Lemma 1 of [Sh3]. For each Euler factor of f at a q | N of the form 
(1—aq q~°) we get also an Euler factor in D(s, f, fP) of the form (1—agāg q~°). 
When f = fọ this can only happen for a split prime q where q’ divides the 
conductor of y but q does not, or for a ramified prime q which does not divide 
the conductor of y. In this case we get a term (1 — g'~*) since |y (q)|? = 
Putting together the propositions of this section we now have a formula for 
a(n) as defined at the beginning of this section. Actually it is more convenient — 
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to give a formula for t(ņnm), an invariant defined in the same way but with 
Ti(M)m, ® O replacing Ti(N)m @® O where M = pMo with p f Mo 
W (km) W (km) 


mi 


and M/N is of the form 


Ha II ¢&. 


qES qat N 
á q| Mo 


Here m; is defined by the requirements that Pm, = po, Ug E mifq| M (q £p) 
and there is an embedding (which we fix) km, — k over ko taking Up — ay 
where ay is the unit eigenvalue of Frobp in pf. So if f’ is the eigenform 
obtained from f by ‘removing the Euler factors’ at q | (M/N) (q # p) and 


removing the non-unit Euler factor at p we have ny = 7#(1) where r : Ti = 
Ti(M)m O — O corresponds to f’ and the adjoint is taken with respect 
W(k 


mI 


to perfect pairings of T) and O with themselves as O-modules, the first one 
assumed 7}-bilinear. 

Property (ii) of yp ensures that M is as in (2.24) with D = (Se, £, ØO, ¢) 
where © is the set of primes dividing M. (Note that Sẹ, is precisely the set of 
primes q for which ng = 1 in the notation of Chapter 2, §3.) As in Chapter 2, 
§3 there is a canonical map 


(4.22) | Rp > Tp 2 Ti(M) 8 O 

| W (km; ) 
which is surjective by the arguments in the proof of Proposition 2.15. Here 
we are considering a slightly more general situation than that in Chapter 2, 
§3 as we are allowing po to be induced from a character of Q(./—3). In this 
special case we define Tp to be T)}(M)m, a The existence of the map 


1 


in (4.22) is proved as in Chapter 2, §3. For the surjectivity, note that for each 
q | M (with q # p) U, is zero in Tp as U, € m; for each such q so that we 
can apply Remark 2.8. To see that U, is in the image of Rp we use that it 
is the eigenvalue of Frob p on the unique unramified quotient which is free of 
rank one in the representation p described after the corollaries to Theorem 2.1 
(cf. Theorem 2.1.4 of [Wi1]). To verify this one checks that Tp is reduced 
or alternatively one can apply the method of Remark 2.11. We deduce that 
U, € TH, the W(km,)-subalgebra of Tı(M)m, generated by the traces, and it 
follows then that it is in the image of Rp. We also need to give a definition of 
Tp where D = (ord, £, O, ¢) and po is induced from a character of Q(./—3). 
For this we use (2.31). 
Now we take 


M=Np || 4. 
qES p 
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The arguments in the proof of Theorem 2.17 show that 
m(nm) is divisible by «(n)(a% — (p))- [[ (@-1) 
qeSy 


where a, is the unit eigenvalue of Frobp in pz). The factor at p is given by 
=- remark 2.18 and at q it comes from the argument of Proposition 2.12 but with 
H = H’ = 1. Combining this with Propositions 4.4, 4.5, and 4,6, we have that 


(4.23) (na) is divisible by Q-?Lw(2, gx) ENC?) (a2 — (p) II (@—-).- 
| q|N 


We deduce: 
THEOREM 4.7. #(O/n(nm)) = # H4 (Qs/Q, V). 


Proof. As explained in Chapter 2, 83 it is sufficient to prove the inequality 
#(O/n(nm)) > #H4.(Qg/Q, V) as the opposite one is immediate. For this it 
suffices to compare (4.23) with Proposition 4.3. Since 


Ly (2,7) = Ln(2,v) = Ln(2, 9X) 


(note that the right-hand term is real by Proposition 4.6) it suffices to pair up 
the Euler factors at q for q | N in (4.23) and in the expression for the upper 
bound of # Hå (Qs/Q, V). o 


We now deduce the main theorem in the CM case using the method of 
Theorem 2.17. 


THEOREM 4.8. Suppose that po as in (1.1) is an irreducible represen- 


tation of odd determinant such that po = Indř ro for a character ko of an 
imaginary quadratic extension L of Q which is unramified at p. Assume also 
that: 


(i) det pol, = Ww; 
(ii) po is ordinary. 
Then for every D = (-,4,0,¢) such that po is of type D with - = Se or ord, 
Rp =~ Tp | 
` and Tp is a complete intersection. 
COROLLARY. For any po as in the theorem suppose that 
p: Gal(Q/Q) + GL2(0) 


is a continuous representation with values in the ring of integers of a local 
field, unramified outside a finite set of primes, satisfying p ~ po when viewed 


as representations to GLa(Fp). Suppose further that: 
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(i) p|_ ts ordinary; 
Dp 
= ye’! with x of finite order, k > 2. 


(ii) det p 
Ip 


Then p is associated to a modular form of weight k. 


Chapter 5 


In this chapter we prove the main results about elliptic curves and espe- 
cially show how to remove the hypothesis that the representation associated 
to the 3-division points should be irreducible. 


Application to elliptic curves 


The key result used is the following theorem of Langlands and Tunnell, 
extending earlier results of Hecke in the case where the projective image is 
dihedral. | 


THEOREM 5.1 (Langlands-Tunnell). Suppose that p:Gal(Q/Q) —> 
GL2(C) is a continuous irreducible representation whose image is finite and 
solvable. Suppose further that det p is odd. Then there exists a weight one 
newform f such that L(s,f) = L(s,p) up to finitely many Euler factors. 


Langlands actually proved in [La] a much more general result without 
restriction on the determinant or the number field (which in our case is Q). 
However in the crucial case where the image in PGL2(C) is S4, the result was 
only obtained with an additional hypothesis. This was subsequently removed | 
by Tunnell in [Tu]. 

Suppose then that 


po: Gal(Q/Q) > GL2(F3) 


is an irreducible representation of odd determinant. We now show, using 
the theorem, that this representation is modular in the sense that over Fs, 
po © Pg, mod u for some pair (g, u) with g some newform of weight 2 (cf. [Se, 
§5.3]). There exists a representation 7 


i: GLo(F3) > GL (Z [v=2]) C GL2(C). 


By composing i with an automorphism of GL2(F3) if necessary we can assume 
that i induces the identity on reduction mod (1 + v-2). So if we consider 
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io po : Gal(Q/Q) — GL2(C) we obtain an irreducible representation which is 
easily seen to be odd and whose image is solvable. Applying the theorem we 
find a newform f of weight one associated to this representation. Its eigenvalues 
lie in Z |V=2 . Now pick a modular form E of weight one such that E = 1(3). 
For example, we can take E = 6 Ei, where E, y is the Eisenstein series with 
Mellin transform given by ¢(s) ¢(s, x) for x the quadratic character associated 
to Q(/—3). Then fE = fmod3 and using the Deligne-Serre lemma ([DS, 
Lemma 6.11]) we can find an eigenform g’ of weight 2 with the same eigenvalues 
as f modulo a prime p’ above (1 + /—2). There is a newform g of weight 2 
which has the same eigenvalues as g’ for almost all T;’s, and we replace (g’, yu’) 
by (g, u) for some prime pu above (1 + ./—2). Then the pair (g, 11) satisfies our 
requirements for a suitable choice of u (compatible with p’). 


We can apply this to an elliptic curve E defined over Q by considering 
_ E|3]. We now show how in studying elliptic curves our restriction to irreducible 
representations in the deformation theory can be circumvented. 


THEOREM 5.2. All semistable elliptic curves over Q are modular. 


Proof. Suppose that E is a semistable elliptic curve over Q. Assume 
first that the representation Jg 3 on E/3] is irreducible. Then if pọ = PE,3 
restricted to Gal(Q/Q (\/—3)) were not absolutely irreducible, the image of the 
restriction would be abelian of order prime to 3. As the semistable hypothesis 
implies that all the inertia groups outside 3 in the splitting field of pọ have 
= order dividing 3 this means that the splitting field of po is unramified outside 
3. However, Q(./—3) has no nontrivial abelian extensions unramified outside 3 
and of order prime to 3. So po itself would factor through an abelian extension 
of Q and this is a contradiction as pọ is assumed odd and irreducible. So 
po restricted to Gal(Q/Q(./—3)) is absolutely irreducible and pg, is then 
modular by Theorem 0.2 (proved at the end of Chapter 3). By Serre’s isogeny 
theorem, F is also modular (in the sense of being a factor of the Jacobian of a 
modular curve). | 

So assume now that Jpg 3 is reducible. Then we claim that the represen- 
tation Jg „s on the 5-division points is irreducible. This is because Xo(15) (Q) 
has only four rational points besides the cusps and these correspond to non- 
semistable curves which in any case are modular; cf. [BiKu, pp. 79-80]. If we 
knew that pg, was modular we could now prove the theorem in the same way 
we did knowing that Jg, was modular once we observe that Jg s restricted to 
Gal(Q/Q(vV/5)) is absolutely irreducible. This irreducibility follows a similar 
argument to the one for pg since the only nontrivial abelian extension of 
Q (v5) unramified outside 5 and of order prime to 5 is Q(¢5) which is abelian 
over Q. Alternatively, it is enough to check that there are no elliptic curves 
E for which Jg is an induced representation over Q(/5) and E is semistable 
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at 5. This can be checked in the supersingular case using the description of 
PE. 5|D; (in particular it is induced from a character of the unramified quadratic 
extension of Qs whose restriction to inertia is the fundamental character of 
level 2) and in the ordinary case it is straightforward. 

Consider the twisted form X(p)/q of X(5)/q defined as follows. Let 
X(5)/Q be the (geometrically disconnected) curve whose non-cuspidal points 
classify elliptic curves with full level 5 structure and let the twisted curve be 
defined by the cohomology class (even homomorphism) in 


H'(Gal(L/Q), Aut X(5)7z) 


given by Jg s: Gal(L/Q) — GLo(Z/5Z) C Aut X(5)/z where L denotes the 
splitting field of Ag 5. Then E defines a rational point on X(p)/qQ and hence 
also of an irreducible component of it which we denote C. This curve C is 
smooth as X(p) gq = X(5 (5)q@ is smooth. It has genus zero since the same is 
true of the irreducible components of X(5) /q. 

A rational point on C (necessarily non-cuspidal) corresponds to an elliptic 
curve E’ over Q with an isomorphism E’[5] ~ E[5] as Galois modules (cf. [DR, 
VI, Prop. 3.2]). We claim that we can choose such a point with the two 
properties that (i) the Galois representation Jg 3 is irreducible and (ii) E’ (or 
a quadratic twist) has semistable reduction at 5. The curve E’ (or a quadratic 
twist) will then satisfy all the properties needed to apply Theorem 0.2. (For the 
primes q Æ 5 we just use the fact that E’ is semistable at q ==> #pg5(Jq) | 5.) 
So E’ will be modular and hence so too will pz’ 5. 

To pick a rational point on C satisfying (i) and (ii) we use the Hilbert irre- 
ducibility theorem. For, to ensure condition (i) holds, we only have to eliminate 
the possibility that the image of fz 3 is reducible. But this corresponds to E’ 
being the image of a rational point on an irreducible covering of C of degree 
4. Let Q(t) be the function field of C. We have therefore an irreducible poly- 
nomial f(z,t) € Q(t)[x] of degree > 1 and we need to ensure that for many 
values tọ in Q, f(z,to) has no rational solution. Hilbert’s theorem ensures 
that there exists a tı such that f(z,t,) is irreducible. Then we pick a prime 
pı Æ 5 such that f(z, t,) has no root mod pı. (This is easily achieved using the 
Cebotarev density theorem; cf. [CF, ex. 6.2, p. 362].) So finally we pick any 
to € Q which is p)-adically close to tı and also 5-adically close to the original 
value of t giving E. This last condition ensures that E’ (corresponding to to) 
or a quadratic twist has semistable reduction at ? - see this, observe that 
since jg #0, 1728, we can find a family E(j): y? = z? — go(j)x — g3(j) with 
rational functions g2(7), g3(j) which are finite at jp and with the j-invariant of 
E(jo) equal to jo whenever the g;(jo) are finite. Then E is given by a quadratic 
twist of E(jg) and so after a change of functions of the form go(j) + u*go(J), 
93(j) — u’g3(j) with u € Q* we can assume that E(jz) = E and that the 
equation E(jg) is minimal at 5. Then for 7’ € Q close enough 5-adically to jz 
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the equation F(j’) is still minimal and semistable at 5, since a criterion for this, 
for an integral model, is that either ords(A(E(j’))) = 0 or ords(c4(E(j’))) = 0. 
So up to a quadratic twist E’ is also semistable. o 


This kind of argument can be applied more generally. 


THEOREM 5.3. Suppose that E is an elliptic curve defined over Q with 
the following properties: 


(i) E has good or multiplicative reduction at 3, 5, 
Gi) For p = 3,5 and for any prime q = —1 mod p either PE p| D, is reducible 
over Fp or PE, p| Iq 18 irreducible over Fp. | 


Then E is modular. 


Proof. The main point to be checked is that one can carry over condi- 
tion (ii) to the new curve E’. For this we use that for any odd prime p £ q, 


PEp|p, is absolutely irreducible and fz,p|1, is absolutely reducible 
and 3 { #PE,p(q) 
i) 


E acquires good reduction over an abelian 2-power extension of 
Q7™ but not over an abelian extension of Qq. 


Suppose then that q = —1(3) and that E’ does not satisfy condition (ii) at 
q (for p = 3). Then we claim that also 3 {| #pe' 3(Iq). For otherwise pz 3 (Iq) 
has its normalizer in GL2(F3) contained in a Borel, whence pg 3(D,) would 
be reducible which contradicts our hypothesis. So using the above equivalence 
we deduce, by passing via pg 5 ~ PE, that E also does not satisfy hypothesis 
(ii) at p = 3. A 
< We also need to ensure that pg 3 is absolutely irreducible over Q(./—3). 
This we can do by observing that the property that the image of pgp 3 lies in the 
Sylow 2-subgroup of GL2(F3) implies that Æ” is the image of a rational point 
on a certain irreducible covering of C of nontrivial degree. We can then argue 
in the same way we did in the previous theorem to eliminate the possibility 
that Pz 3 was reducible, this time using two separate coverings to ensure that 
the image of fz 3 is neither reducible nor contained in a Sylow 2-subgroup. 
Finally one also has to show that if both pz 5 is reducible and pz 3 is 
induced from a character of Q(./—3) then E is modular. (The case where 
both were reducible has already been considered.) Taylor has pointed out 
that curves satisfying both these conditions are classified by the non-cuspidal 
rational points on a modular curve isomorphic to X9(45)/Wo, and this is an 
elliptic curve isogenous to Xo(15) with rank zero over Q. The non-cuspidal 
rational points correspond to modular elliptic curves of conductor 338. O 
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Appendix 


Gorenstein rings and local complete intersections 
| Dik 
PROPOSITION 1. Suppose that O is a complete discrete valuation ring 
and that p: S — T is a surjective local O-algebra homomorphism between com- 
plete local Noetherian O-algebras. Suppose further that pr is a prime ideal of 
T such that T/pr —> O and let ps = y~! (pr). Assume that 


(i) T ~ Ofa,...,2,]/(fi,.--,;fr—u) where r is the size of a minimal set of 
O-generators of pr/p4, 

(ii) y induces an isomorphism ps/p% —> pr/p2 and that these are finitely 
‘generated O-modules whose free part has rank u. | 


Then y is an isomorphism. 


Proof. First we consider the case where u = 0. We may assume that the 
generators z1, ...,Tr lie in pr by subtracting their residues in T/pr —> O. By 
(ii) we may also write 


SO [Picea l/s gs) 


with s > r (by allowing repetitions if necessary) and pg generated by the 
images of {x1,...,2,}. Let p = (£1,...,£r) in Ofz1,...,z,]. Writing f; = 
Xaij£zj mod p? with a;; € O, we see that the Fitting ideal as an O-module of 
pr/pż is given by | 

Fo(pr/p7) = det(aiz) € O 
and that this is nonzero by the hypothesis that u = 0. Similarly, if each 
gi = Xbijzj mod p*, then 


Folps/p4) = {det(b;):i € I, #I =r, IC {1,...,85}}. 


By (ii) again we see that det(a;j) = det(b;;) as ideals of O for some choice Ip 


of J. After renumbering we may assume that Jp = {1,...,r}. Then each g; 
(i =1,...,r) can be written g; = Urj; fi for some ri; € O[x1,...,2,] and we 
have 


det (b;;) = det(r;;) - det(a;;) mod p. 


Hence det(r;;) is a unit, whence (r;;) is an invertible matrix. Thus the f;’s can 
be expressed in terms of the g;’s and so S œT. 

We can extend this to the case u # 0 by picking z1, ...,£r—u so that they 
generate (pr/p2.)'"S. Then we can write each f; = X; Y 32; mod p? and 
likewise for the g;’s. The argument is now just as before but applied to the 
Fitting ideals of (pr/p2.)'°. | o 
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For the next proposition we continue to assume that O is a complete 
discrete valuation ring. Let T be a local O-algebra which as a module is finite 
and free over O. In addition, we assume the existence of an isomorphism of 
T-modules T —+ Homo(T, QO). We call a local O-algebra which is finite and 
free and satisfies this extra condition a Gorenstein O-algebra (cf. §5 of [Til]). 
Now suppose that p is a prime ideal of T such that T/p ~ O. 

Let @:T — T/p ~ O be the natural map and define a principal ideal of T 
by 


(nr) = (6(2)) 


where 3:0 — T is the adjoint of @ with respect to perfect O-pairings on O 
and T, and where the pairing of T with itself is T-bilinear. (By a perfect 
pairing on a free O-module M of finite rank we mean a pairing M x M > O 
such that both the induced maps M — Home(M, ©) are isomorphisms. When 
M =T we are thus requiring that this be an isomorphism of T-modules also.) 
The ideal (77) is independent of the pairings. Also T/nr is torsion-free as an 
O-module, as can be seen by applying Hom ( , ©) to the sequence 


0—>p>T=>=0—0, 


to obtain a homomorphism T/nr — Hom(p, ©). This also shows that (nr) = 
Ann p. 
If we let 1(M) denote the length of an O-module M, then 


\(p/p?) > 1(O/7r) 


(where we write 77 for G(nr)) because p is a faithful T/nr-module. (For a 
brief account of the relevant properties of Fitting ideals see the appendix to 
[MW1].) Indeed, writing FR(M) for the Fitting ideal of M as an R-module, 
we have 


Frj (0) = 0 => Fr(p) C (mr) > Fryp(p/62) C (FR) 


and we then use the fact that the length of an O-module M is equal to the 
length of O/Fo(M) as O is a discrete valuation ring. In particular when p/p? 
is a torsion O-module then fir # 0. 

We need a criterion for a Gorenstein O-algebra to be a complete inter- 
section. We will say that a local O-algebra S which is finite and free over 
O is a complete intersection over O if there is an O-algebra isomorphism 
S ~ O[zi,...,2r]/(fi,..., fr) for some r. Such a ring is necessarily a Goren- 
stein O-algebra and {f,..., fr} is necessarily a regular sequence. That (i) > 
(ii) in the following proposition is due to Tate (see A.3, conclusion 4, in the 
appendix in [M Rol.) | 


PROPOSITION 2. Assume that O is a complete discrete valuation ring 
and that T is a local Gorenstein O-algebra which is finite and free over O and 
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that pr is a prime ideal of T such that T/pr ~ O and pr /p> 1s a torsion 
O-module. Then the following two conditions are equivalent: 


(i) T is a complete intersection over O. 
(ii) (pr/p2.) = 1(O/AF) as O-modules. 
Proof. To prove that (ii) > (i), pick a complete intersection S over Ø (so 
assumed finite and flat over O) such that a: S-+»T and such that ps/p2, ~ pr/p?. 


where ps = a ‘(pr). The existence of such an S seems to be well known 
(cf. [Ti2, §6]) but here is an argument suggested by N. Katz and H. Lenstra 


(independently). 
Write T = Ofai,...,2r]/(fi,..., fs) with pr the image in T of p = 
(v1,...,2,). Since T is local and finite and free over O, it follows that also 


DOU sig te fisnsesJa)s We can pick gi,...,9, such that gi = La; fj 
with ai; E€ O and such that NE 


(Jisses fae) = (g1, -- -3 gr, p°). 


We then modify g1,..., gr by the addition of elements {a;} of (f1,..., fs)” and 
set (gi = 91 +aQ1,...,9) =gr+ar,). Since T is finite over O, there exists an N 
such that for each i, aN can be written in T as a polynomial h,(x1,...,2,) of 
total degree less than N. We can assume also that N is chosen greater than 
the total degree of g; for each i. Set a; = (x — h;j(x1,...,2,))?. Then set 
S = Ofz1,...,27]/(g)---,g).). Then S is finite over O by construction and also 
dim(S) < 1 since dim( S/A) = 0 where (A) is the maximal ideal of O. It follows 
that {g},...,9,} is a regular sequence and hence that depth(S) = dim(S) = 1. 
In particular the maximal O-torsion submodule of S is zero since it is also a 
finite length S-submodule of S. | | 

Now O/(ñs) ~ O/(ñr), since I(0/(ñs)) = I(ps/ps) by (i) = (ii) and 
l(O/(f#r)) = U(pr/p2) by hypothesis. Pick isomorphisms 


T ~Homo(T,O), S ~ Homo(S, 0) 


as T-modules and S-modules, respectively. The existence of the latter for 
complete intersections over ØO is well known; cf. conclusion 1 of Theorem A.3 
of [M Ro]. Then we have a sequence of maps, in which & and @ denote the 
adjoints with respect to these isomorphisms: 


Of 2.6 2.72. o. 


One checks that â is a map of S-modules (T being given an S-action via a) 
and in particular that ao â is multiplication by an element t of T. Now 
(808) = (fr) in O and (80a) 0(Boa) = (ñs) in O. As (ñs) = (ñr) in O, we 


have that ¢ is a unit mod pr and hence that ao4@ is an isomorphism. It follows 
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that S ~ T, as otherwise S ~ kera ®imâ is a nontrivial decomposition as 
S-modules, which contradicts S being local. oO 


Remark. Lenstra has made an important improvement to this proposi- 
tion by showing that replacing 77 by G(annp) gives a criterion valid for all 
local O-algebras which are finite and free over O, thus without the Gorenstein 


hypothesis. 
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